List: Large Language Models | Curated by Vyacheslav Efimov

Sep 13, 2024
12 stories
14 saves
Large Language ModelsA series of articles on LLMs with detailed and visualised explanations
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, ALBERT — A Lite BERT for Self-supervised LearningUnderstand essential techniques behind BERT architecture choices for producing a compact and efficient model
Nov 7, 2023
1
Nov 7, 2023
1
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, GPT-3: Language Models are Few-Shot LearnersEfficiently scaling GPT from large to titanic magnitudes within the meta-learning framework
Feb 16
1
Feb 16
1
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, MirrorBERT — Transforming Models into Universal Lexical and Sentence…Discover how mirror augmentation generates data and aces the BERT performance on semantic similarity tasks
Dec 12, 2023
Dec 12, 2023
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, GPT-2 — Language Models are Unsupervised Multitask LearnersAcing GPT capabilities by turning it into a powerful multitask zero-shot model.
Feb 10
Feb 10
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, GPT-1 — Generative Pre-Trained TransformerDiving deeply into the working structure of the first ever version of gigantic GPT-models
Jan 27
7
Jan 27
7
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: DeBERTa — Decoding-Enhanced BERT with Disentangled AttentionExploring the advanced version of the attention mechanism in Transformers
Nov 28, 2023
3
Nov 28, 2023
3
Vyacheslav Efimov
in
Towards Data Science
Large Language Models, StructBERT — Incorporating Language Structures into PretrainingMaking models smarter by incorporating better learning objectives
Nov 22, 2023
Nov 22, 2023
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: TinyBERT — Distilling BERT for NLPUnlocking the power of Transformer distillation in LLMs
Oct 21, 2023
Oct 21, 2023
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: DistilBERT — Smaller, Faster, Cheaper and LighterUnlocking the secrets of BERT compression: a student-teacher framework for maximum efficiency
Oct 7, 2023
1
Oct 7, 2023
1
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: RoBERTa — A Robustly Optimized BERT ApproachLearn about key techniques used for BERT optimisation
Sep 24, 2023
Sep 24, 2023
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: SBERT — Sentence-BERTLearn how siamese BERT networks accurately transform sentences into embeddings
Sep 12, 2023
1
Sep 12, 2023
1
Vyacheslav Efimov
in
Towards Data Science
Large Language Models: BERT — Bidirectional Encoder Representations from TransformerUnderstand how BERT constructs state-of-the-art embeddings
Aug 30, 2023
5
Aug 30, 2023
5