Siphumelelo Talent QwabeEgoPlan-Bench2: Is This the Key to Unlocking Artificial General Intelligence?In the rapidly evolving field of artificial intelligence (AI), the pursuit of artificial general intelligence (AGI) has been significantly…3d ago
Kushagra MisraEra of Light Weight LLM Models: Innovations and Mobile PossibilitiesHi fellow readers… The recent emergence of Multimodal Large Language Models (MLLMs) has significantly advanced both AI research and…Aug 31
Joyce BirkinsAgent S: Controlling Computers through Conversation, Currently at a Success Rate Below 30%The research on RAG has concluded, and recently, the open-source project “Agent S” emerged over the past four days, prompting a deep dive…Oct 17Oct 17
InTowards Data SciencebyYouness Mansar6 Real-World Uses of Microsoft’s Newest Phi-3 Vision-Language ModelExploring possible use cases of Phi-3-Vision, a small yet powerful MLLM that can be run locally (with code examples)May 244May 244
Rajesh Mani Kumar GOpenBMB MiniCPM-V MLLM Tiny & Mighty Multimodal LLMMiniCPM-V 2.6 from Openbmb is built on SigLip-400M and Qwen2–7B with a total of 8B parameters. With many new features like Multi Image…Aug 11Aug 11
Siphumelelo Talent QwabeEgoPlan-Bench2: Is This the Key to Unlocking Artificial General Intelligence?In the rapidly evolving field of artificial intelligence (AI), the pursuit of artificial general intelligence (AGI) has been significantly…3d ago
Kushagra MisraEra of Light Weight LLM Models: Innovations and Mobile PossibilitiesHi fellow readers… The recent emergence of Multimodal Large Language Models (MLLMs) has significantly advanced both AI research and…Aug 31
Joyce BirkinsAgent S: Controlling Computers through Conversation, Currently at a Success Rate Below 30%The research on RAG has concluded, and recently, the open-source project “Agent S” emerged over the past four days, prompting a deep dive…Oct 17
InTowards Data SciencebyYouness Mansar6 Real-World Uses of Microsoft’s Newest Phi-3 Vision-Language ModelExploring possible use cases of Phi-3-Vision, a small yet powerful MLLM that can be run locally (with code examples)May 244
Rajesh Mani Kumar GOpenBMB MiniCPM-V MLLM Tiny & Mighty Multimodal LLMMiniCPM-V 2.6 from Openbmb is built on SigLip-400M and Qwen2–7B with a total of 8B parameters. With many new features like Multi Image…Aug 11
maadaa.aiMultimodal Large Language Model Dataset | Image-Text Pairs Dataset | LAION 5Bmaadaa.ai’s New Image-Text Pairs Dataset for Commercial Multimodal Large Language ModelsApr 18
InTowards Data SciencebyYouness MansarA Simple Recipe to Boost the Performance of MLLMs on Your Custom Use CaseAn MLLM QLoRA fine-tuning tutorial using the newest pocket-sized Mini-InternVL modelJun 11
Memoona TahiraPDF Parsing options in 2024 for RAG based ChatbotOverview of Open-Source Options for Document Parsing and Data IngestionNov 11