The most insightful stories about Distillation - Medium

Machine Learning

Artificial Intelligence

Distillation

Topic

·

8 Followers

·

143 Stories

Recommended stories

In
Generative AI
by
Fabio Matricardi
Why Small Language Models are so good?
They are over-trained, but they work just fine. What’s the secret behind it?
Oct 13
2
Sai Chaitanya Pachipulusu
Quantization, Distillation and Pruning
Advancing LLMs Through Efficient Model Compression Techniques
Nov 21
In
HuggingFace
by
Victor Sanh
🏎 Smaller, faster, cheaper, lighter: Introducing DilBERT, a distilled version of BERTYou can find the code to reproduce the training of DilBERT along with pre-trained weights for DilBERT here.
Aug 28, 2019
20
Aug 28, 2019
20
David Pollington
Enabling AI at the EdgeCutting-edge AI/ML models, and especially large language and multimodal models (LLMs; LMMs), are capable of a wide-range of sophisticated…
Nov 12
Nov 12
Aaditya ura
Quantization vs Distillation in Neural Networks: A ComparisonA dive into the techniques of quantizing and distilling deep learning models: What are they and how do they differ?
Nov 11, 2023
Nov 11, 2023

Why Small Language Models are so good?

Why Small Language Models are so good?

In

Generative AI

by

Fabio Matricardi

Why Small Language Models are so good?

They are over-trained, but they work just fine. What’s the secret behind it?

Oct 13

Quantization, Distillation and Pruning

Quantization, Distillation and Pruning

Sai Chaitanya Pachipulusu

Quantization, Distillation and Pruning

Advancing LLMs Through Efficient Model Compression Techniques

Nov 21

🏎 Smaller, faster, cheaper, lighter: Introducing DilBERT, a distilled version of BERT

In

HuggingFace

by

Victor Sanh

🏎 Smaller, faster, cheaper, lighter: Introducing DilBERT, a distilled version of BERT

You can find the code to reproduce the training of DilBERT along with pre-trained weights for DilBERT here.

Aug 28, 2019

Enabling AI at the Edge

David Pollington

Enabling AI at the Edge

Cutting-edge AI/ML models, and especially large language and multimodal models (LLMs; LMMs), are capable of a wide-range of sophisticated…

Nov 12

Quantization vs Distillation in Neural Networks: A Comparison

Aaditya ura

Quantization vs Distillation in Neural Networks: A Comparison

A dive into the techniques of quantizing and distilling deep learning models: What are they and how do they differ?

Nov 11, 2023

Distillation Toward, Godwilling, Alchemy

In

It’s My Life 2.3

by

Obinna Morton

Distillation Toward, Godwilling, Alchemy

Our default to mediocrity, living in reality while still believing it is possible? I am really taking a minute to breathe and think when I…

Oct 19

Smaller, Faster, Smarter: The Power of Model Distillation

In

Towards AI

by

Louis-François Bouchard

Smaller, Faster, Smarter: The Power of Model Distillation

Why OpenAI’s New Approach Challenges the Open-Source AI Community

Oct 10

Self-Improving LLMs & Enhancing Reasoning

Bijit Ghosh

Self-Improving LLMs & Enhancing Reasoning

AlphaLLM-CPL Revolutionizing LLM Reasoning with MCTS Behavior Distillation and Adaptive Learning Strategies

Oct 16

See more recommended stories