azharinazhar labsBuilding Mamba from Scratch: A Comprehensive Code WalkthroughIn the realm of deep learning, sequence modeling remains a challenging task, often tackled by models such as LSTMs and Transformers…Dec 29, 20235Dec 29, 20235
Ogban UgotNotes on fine-tuning Llama 2 using QLoRA: A detailed breakdownDetailed notes include the relevant open-source libraries, the important Classes and Methods, and the theoretical techniques they…Sep 19, 20232Sep 19, 20232
Cameron R. Wolfe, Ph.D.LLaMA-2 from the Ground UpEverything you need to know about the best open-source LLM on the market…Dec 20, 2023Dec 20, 2023
Aki KutvonenHow to train sentencepiece tokenizers with common crawl (multilanguage)Introducing a set of common crawl pre-trained sentencepiece tokenizers for Japanese and English, and and a codebase to train more for…Oct 21, 2021Oct 21, 2021
Michael PhiinTowards Data ScienceIllustrated Guide to Transformers- Step by Step ExplanationTransformers are taking the natural language processing world by storm. These incredible models are breaking multiple NLP records and…Apr 30, 202021Apr 30, 202021
Chamanth mvsDecoder-only Transformer modelUnderstanding Large Language models with GPT-1Jun 18, 20233Jun 18, 20233
Abonia SojasingarayarBest LLM and LLMOps Resources for 2023Curated list of best courses, books, resources on large language modelMay 19, 20234May 19, 20234