Tushar Madaan – Medium

Tushar Madaan

Home

Lists

About

Tushar Madaan

Reproducing GPT-2 (124M): Key Insights and Techniques

In this blog, we’ll explore the key details of reproducing the GPT-2 (124M) model, focusing on its architecture, training process, and…

4d ago

Reproducing GPT-2 (124M): Key Insights and Techniques

4d ago

Tushar Madaan

Navigating Model Drift: A Systems Approach

Imagine a small town of Univille, with only one university where exactly 100 students apply each year. Each student is tested on 2 exams…

May 30

Navigating Model Drift: A Systems Approach

May 30

Tushar Madaan

Bayesian thinking , reward systems and the computational inefficiency of skepticism.

Even if we start with small biases(unbalanced priors), our priors creep into our sampling of the world and thus have likelihood to drive…

Nov 4, 2015

Nov 4, 2015

Tushar Madaan

Tushar Madaan

Following

Murilo Gustineli
Robby Kiskanyan
Artem Shelamanov
Louis-François Bouchard
Benedict Neo

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams