Replicating GPT2–1.5B

Connor Leahy
Jun 6 · 8 min read

The important facts

The story in brief

Issues

Where my model differs from the original GPT2–1.5B

A few thoughts on my experiences

Some Pro Tips

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade