Benjamin Marie – Medium

Benjamin Marie

Pinned

Benjamin Marie
in
Towards Data Science

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

Cheap supervised fine-tuning with an impressive LLM

Oct 26, 2023

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

Oct 26, 2023

Pinned

Benjamin Marie
in
Towards Data Science

Run Mixtral-8x7B on Consumer Hardware with Expert Offloading

Finding the right trade-off between memory usage and inference speed

Jan 11

Run Mixtral-8x7B on Consumer Hardware with Expert Offloading

Jan 11

Benjamin Marie
in
Towards Data Science

Multi-GPU Fine-tuning for Llama 3.1 70B with FSDP and QLoRA

What you can do with only 2x24 GB GPUs and a lot of CPU RAM

1d ago

Multi-GPU Fine-tuning for Llama 3.1 70B with FSDP and QLoRA

1d ago

Benjamin Marie

ThinK: KV Cache Pruning for Memory Efficient Inference

A promising approach if combined with KV cache quantization

1d ago

ThinK: KV Cache Pruning for Memory Efficient Inference

1d ago

Benjamin Marie
in
Towards Data Science

Serve Multiple LoRA Adapters with vLLM

Without any increase in latency

5d ago

Serve Multiple LoRA Adapters with vLLM

5d ago

Benjamin Marie

More Evidence that Ternary LLMs Are Good Enough

-1, 0, and 1 are all you need to make good LLMs

Jul 25

More Evidence that Ternary LLMs Are Good Enough

Jul 25

Benjamin Marie
in
Towards Data Science

Function Calling: Fine-Tuning Llama 3 on xLAM

Fast and memory-efficient thanks to QLoRA

Jul 23

Function Calling: Fine-Tuning Llama 3 on xLAM

Jul 23

Benjamin Marie

Q-GaLore: Train LLMs from Scratch with a 16 GB GPU

GaLore but with quantization

Jul 21

Q-GaLore: Train LLMs from Scratch with a 16 GB GPU

Jul 21

Benjamin Marie

Data Contamination for LLM Code Benchmarking, Can We Avoid It?

Probably not, but we can try.

Jul 19

Data Contamination for LLM Code Benchmarking, Can We Avoid It?

Jul 19

Benjamin Marie

Fine-tune Gemma 2 on Your Computer with LoRA and QLoRA

Using Hugging Face libraries and Unsloth

Jul 16

Fine-tune Gemma 2 on Your Computer with LoRA and QLoRA

Jul 16

Benjamin Marie

Benjamin Marie

Ph.D, research scientist in NLP/AI. Medium "Top writer" in AI and Technology. Exclusive articles and all my AI notebooks on https://kaitchup.substack.com/

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams