Ritvik Rastogi – Medium

Ritvik Rastogi

Pinned

Ritvik Rastogi

Thanks for the appreciation, Its surreal for me to get acknowledged from the author itself.

1 min readFeb 1, 2024

--

--

Ritvik Rastogi

Papers Explained 147: LongLoRA

LongLoRA is an efficient fine-tuning approach that extends the context sizes of pre-trained LLMs, with limited computation cost.

4 min read9 hours ago

--

Papers Explained 147: LongLoRA

--

Ritvik Rastogi

Papers Explained 146: QLoRA

QLoRA is an efficient finetuning approach that reduces memory usage for fine-tuning hplarge models on a single GPU while preserving full…

6 min read2 days ago

--

Papers Explained 146: QLoRA

--

Ritvik Rastogi

Papers Explained 145: LoRA

Low-Rank Adaptation or LoRA freezes the pretrained model weights and injects trainable rank decomposition matrices into each layer of the…

5 min read4 days ago

--

1

Papers Explained 145: LoRA

--

1

Ritvik Rastogi

Paper Explained 144: Granite Code Models

This paper introduces a series of decoder-only code models (3B, 8B, 20B, 34B) for code generative tasks, trained with code written in 116…

10 min readMay 31, 2024

--

Paper Explained 144: Granite Code Models

--

Ritvik Rastogi

Papers Explained 143: Chameleon

Chameleon is a family of early-fusion token-based mixed-modal models capable of reasoning over and generating interleaved image-text…

8 min readMay 29, 2024

--

Papers Explained 143: Chameleon

--

Ritvik Rastogi

Papers Explained 142: Gemini 1.5 Flash

The tech report introduces two new models: Gemini 1.5 Pro and Gemini 1.5 Flash.

16 min readMay 27, 2024

--

Papers Explained 142: Gemini 1.5 Flash

--

Ritvik Rastogi

Papers Explained 141: Tool LLM

Open-source LLMs struggle with tasks that require interaction with external tools or APIs, to address this limitation, this paper…

6 min readMay 24, 2024

--

Papers Explained 141: Tool LLM

--

Ritvik Rastogi

Papers Explained 140: Toolformer

Toolformer is a model trained to decide which APIs to call, when to call them, what arguments to pass, and how to best incorporate the…

6 min readMay 22, 2024

--

Papers Explained 140: Toolformer

--

Ritvik Rastogi

Papers Explained 139: Gorilla

Gorilla is retrieve-aware finetuned LLaMA-7B model, specifically for API calls. It substantially mitigates the issue of hallucination…

6 min readMay 20, 2024

--

Papers Explained 139: Gorilla

--

Ritvik Rastogi

Ritvik Rastogi

Data Scientist, 2x Kaggle Expert

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams