Elon Musk's Grok 1.5 Vision MultiModel Is Here-Open Source Better Than Open AI And Google Gemini Pro
Krish Naik Krish Naik
946K subscribers
10,449 views
0

 Published On Apr 18, 2024

Introducing Grok-1.5V, our first-generation multimodal model. In addition to its strong text capabilities, Grok can now process a wide variety of visual information, including documents, diagrams, charts, screenshots, and photographs. Grok-1.5V will be available soon to our early testers and existing Grok users.

Capabilities
Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. We are particularly excited about Grok’s capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark that measures real-world spatial understanding. For all datasets below, we evaluate Grok in a zero-shot setting without chain-of-thought prompting.
-------------------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
   / @krishnaik06  
-----------------------------------------------------------------------------------
Fresh Langchain Playlist:    • Fresh And Updated Langchain Series- U...  
►LLM Fine Tuning Playlist:    • Steps By Step Tutorial To Fine Tune L...  
►AWS Bedrock Playlist:    • Generative AI In AWS-AWS Bedrock Cras...  
►Llamindex Playlist:    • Announcing LlamaIndex Gen AI Playlist...  

►Google Gemini Playlist:    • Google Is On Another Level- Check Out...  
►Langchain Playlist:    • Amazing Langchain Series With End To ...  
►Data Science Projects:
   • Now you Can Crack Any ML Interviews- ...  

►Learn In One Tutorials

Statistics in 6 hours:    • Complete Statistics For Data Science ...  

End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's

Machine Learning In 6 Hours:    • Complete Machine Learning In 6 Hours|...  

Deep Learning 5 hours :    • Deep Learning Indepth Tutorials In 5 ...  

►Learn In a Week Playlist

Statistics:   • Live Day 1- Introduction To statistic...  

Machine Learning :    • Announcing 7 Days Live Sessions On Ma...  

Deep Learning:   • 5 Days Live Deep Learning Community S...  

NLP :    • Announcing NLP Live community Sessions  
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: https://amzn.to/4886inY
Office Desk : https://amzn.to/48nAWcO
Camera: https://amzn.to/3vcEIHS
Writing Pad:https://amzn.to/3OuXq41
Monitor: https://amzn.to/3vcEIHS
Audio Accessories: https://amzn.to/48nbgxD
Audio Mic: https://amzn.to/48nbgxD

show more

Share/Embed