Sitemap

Llama 3.2 Vision, the new multi-modal LLM by Meta

4 min readSep 26, 2024

--

Photo by DAVIS VARGAS on Unsplash

Multimodal Capabilities

This is just too good !!

Model Variants

Most of the Chatbot UIs we use like ChatGPT, Perplexity are usually instruction fine-tuned

Architecture

What is an image adapter?

What is a Vision tower?

Evaluations and metrics

Llama Guardrails

Where to access?

How to run it locally?

Hope this was useful, and you try out Llama3.2 shortly!!

--

--

Responses (2)