Benjamin Marie – Medium

Pinned

Benjamin Marie
in
Towards Data Science

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

Cheap supervised fine-tuning with an impressive LLM

Oct 26, 2023

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

Oct 26, 2023

Pinned

Benjamin Marie
in
Towards Data Science

Run Mixtral-8x7B on Consumer Hardware with Expert Offloading

Finding the right trade-off between memory usage and inference speed

Jan 11

Run Mixtral-8x7B on Consumer Hardware with Expert Offloading

Jan 11

Benjamin Marie

Piccolo2: Multitask Hybrid Training for Text Embeddings

Exploiting datasets from different types of tasks for training better text embeddings

1d ago

Piccolo2: Multitask Hybrid Training for Text Embeddings

1d ago

Benjamin Marie

SUPRA: Turn a Transformer Model into an RNN Model

But it’s not cheap

5d ago

SUPRA: Turn a Transformer Model into an RNN Model

5d ago

Benjamin Marie

Sparse Llama: 70% Smaller, 3x Faster, and Full Accuracy

Pruning and short pre-training

May 17

Sparse Llama: 70% Smaller, 3x Faster, and Full Accuracy

May 17

Benjamin Marie

RWKV-6: Attention-free and State-of-the-art 7B LLM

Especially good for multilingual tasks

May 15

RWKV-6: Attention-free and State-of-the-art 7B LLM

May 15

Benjamin Marie

Fine-tune Tiny Chat Models with Apple OpenELM and ORPO

Can we make a good chat model with a 270M LLM?

May 12

Fine-tune Tiny Chat Models with Apple OpenELM and ORPO

May 12

Benjamin Marie
in
Towards Data Science

Turn Llama 3 into an Embedding Model with LLM2Vec

RAG with Llama 3 for the generation and the retrieval

May 3

Turn Llama 3 into an Embedding Model with LLM2Vec

May 3

Benjamin Marie
in
Towards Data Science

Jamba: The New Hybrid Transformer/Mamba

Faster and better than the transformer but more difficult to train

Apr 30

Jamba: The New Hybrid Transformer/Mamba

Apr 30

Benjamin Marie

Estimate the Memory Consumption of LLMs for Inference and Fine-tuning

A close look at the memory consumption of Command-R+, Mixtral-8x22B, and Llama 3 70B

Apr 27

Estimate the Memory Consumption of LLMs for Inference and Fine-tuning

Apr 27

Benjamin Marie

Benjamin Marie

Ph.D, research scientist in NLP/AI. Medium "Top writer" in AI and Technology. Exclusive articles and all my AI notebooks on https://kaitchup.substack.com/

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams