Kevin François – Medium

Kevin François

Kevin François
in
neoxia

LLm infini-attention with linear complexity

Introducing Google’s Infini-attention to increase LLM attention windows and reduce quadratic complexity

12 min readApr 26, 2024

--

LLm infini-attention with linear complexity

--

Kevin François
in
neoxia

Deep dive in embeddings

Full explanation of word and image embedding with the explanation of reference models such as Bert and Clip

15 min readMar 21, 2024

--

Deep dive in embeddings

--

Kevin François
in
neoxia

Mixtral 8x7B explained

This review aims to expound upon the MoE framework and explore the added benefits that Mixtral 8x7B brings to these specific fie

7 min readMar 21, 2024

--

Mixtral 8x7B explained

--

Kevin François
in
neoxia

Proxy-Tuning: A Breakthrough in Customizing Large Language Models

Fine-tune LLM through next token probability

4 min readMar 18, 2024

--

Proxy-Tuning: A Breakthrough in Customizing Large Language Models

--

Kevin François
in
neoxia

The Era of 1-bit LLMs

Introduction: Deep dive in LLM quantization

7 min readMar 13, 2024

--

The Era of 1-bit LLMs

--

Kevin François

Kevin François

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams