Tagged in

NLP

DAIR.AI

Democratizing Artificial Intelligence Research, Education, Technologies

More information

Followers

4.91K

Elsewhere

More, on Medium

Ritvik Rastogi in DAIR.AI

Oct 22, 2023

Papers Explained: Mistral 7B

Mistral 7B is an LLM engineered for superior performance and efficiency…

Ritvik Rastogi in DAIR.AI

Oct 12, 2023

Papers Explained 61: Humpback

Instruction back translation is a scalable method to build a high-quality instruction following language…

Ritvik Rastogi in DAIR.AI

Oct 19, 2023

Papers Explained 63: LLaMA 2 Long

LLaMA 2 Long is a series of long-context LLMs built through continual pretraining from LLAMA 2 with…

Ritvik Rastogi in DAIR.AI

Oct 15, 2023

Papers Explained 62: Code Llama

Ritvik Rastogi in DAIR.AI

Oct 8, 2023

Papers Explained 60: Llama 2

Ritvik Rastogi in DAIR.AI

Oct 5, 2023

Papers Explained 59: Falcon

As larger models require pretraining on trillions of tokens, it is unclear how scalable is curation of…

Ritvik Rastogi in DAIR.AI

Sep 25, 2023

Papers Explained 57: LIMA

Large language models are trained in two stages: (1) unsupervised pretraining from raw text, to learn…

Ritvik Rastogi in DAIR.AI

Oct 1, 2023

Papers Explained 58: PaLM 2

PaLM 2 is the successor of PaLM. It’s more compute efficient and is pre-trained on a more multilingual &…

Ritvik Rastogi in DAIR.AI

Sep 17, 2023

Papers Explained 56: Alpaca

Alpaca is fine-tuned from Meta’s LLaMA 7B model. The Alpaca model is trained on 52K instruction-following…

Ritvik Rastogi in DAIR.AI

Sep 10, 2023

Papers Explained 55: LLaMA