Everything You Need to Know About LLaMA

Meta’s latest collection of large language models (LLMs) — a brief summary! Very brief!

Published in

AI Achilles

3 min readFeb 26, 2023

LLaMA stands for Large Language Model Meta AI.

It is a transformer-based model with four size variations: 7B, 13B, 33B, and 65B parameters.

(Not as impressive as a 500B LLM, eh?)

Because LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, despite being 10× smaller.

In LLMs, it is an academically proven phenomenon that more parameters lead to better performance.

That’s not true anymore!

The best performances are not achieved by the largest models but by smaller models trained on more data.

To advance AI, NLP in particular, we need models that are accessible to everyone.

Not every research team can computationally afford to build a 500B parameter model.

LLaMA is termed as a foundational model that will help democratize the access of LLMs and enable researchers to advance this field.