SteinwayHow to use Transformer Reinforcement Learning(TRL) for fine-tune GPT-2 with PyTorchIn this article, reinforcement learning training method like SFT and DPO will be introduced to everyone. For the basic fine-tuning in…6d ago
Amit KharelinTowards AIHow I built an LLM that can Sing (From Scratch) (Part 2)Build and Train a 29M GPT-2 Model from scratchJun 301
ANSHUL SHIVHAREHarnessing the Power of Soft Prompts: A Hands-On Guide to Fine-Tuning for Text SummarizationIn our previous blog, we delved into the concept of Soft Prompts and their role in enhancing the performance of Large Language Models…May 10May 10
SteinwayHow to deploy your GPT-2 model into Android PhoneThis article will use two methods to introduce how to convert a gpt2 model into a tflite model and then embed it into an Android phone for…Oct 22Oct 22
SteinwayHow to use Transformer Reinforcement Learning(TRL) for fine-tune GPT-2 with PyTorchIn this article, reinforcement learning training method like SFT and DPO will be introduced to everyone. For the basic fine-tuning in…6d ago
Amit KharelinTowards AIHow I built an LLM that can Sing (From Scratch) (Part 2)Build and Train a 29M GPT-2 Model from scratchJun 301
ANSHUL SHIVHAREHarnessing the Power of Soft Prompts: A Hands-On Guide to Fine-Tuning for Text SummarizationIn our previous blog, we delved into the concept of Soft Prompts and their role in enhancing the performance of Large Language Models…May 10
SteinwayHow to deploy your GPT-2 model into Android PhoneThis article will use two methods to introduce how to convert a gpt2 model into a tflite model and then embed it into an Android phone for…Oct 22
RupakHow to Fine-Tune GPT-2 for a Domain-Specific ChatbotIf you want a chatbot that responds specifically within your domain, you can fine-tune GPT-2 for that purpose which has only 124 million…Aug 17
SteinwayFine tune your own GPT-2 model with PyTorch for generate textIn this story, we will introduce how to fine-tune GPT-2 from transformers using Torch CUDA(overall installation steps). In addition, well…Oct 21
Matthew GuntoninTowards Data ScienceLine By Line, Let’s Reproduce GPT-2: Section 1This blog post will go line-by-line through the code in Section 1 of Andrej Karpathy’s “Let’s reproduce GPT-2 (124M)”Jul 23