Stanislav Fedotov – Medium

Stanislav Fedotov

Stanislav Fedotov
in
Nebius

How transformers, RNNs and SSMs are more alike than you think

By uncovering surprising links between seemingly unrelated LLM architectures, a way might be paved for effective idea exchange and boosting…

Sep 6

How transformers, RNNs and SSMs are more alike than you think

Sep 6

Stanislav Fedotov
in
Nebius

Mixtures of Experts and scaling laws

Mixture of Experts (MoE) has become popular as an efficiency-boosting architectural component for LLMs. In this blog post, we’ll explore…

Aug 13

Mixtures of Experts and scaling laws

Aug 13

Stanislav Fedotov
in
Nebius

Fundamentals of LoRA and low-rank fine-tuning

In the next installment of our series of deep technical articles on AI research, let’s switch our attention to the famous LoRA, a low-rank…

Jun 17

Fundamentals of LoRA and low-rank fine-tuning

Jun 17

Stanislav Fedotov
in
Nebius

Transformer alternatives in 2024

With this article, we are starting a new category on our blog, the one dedicated to AI research. Expect these posts to be very technical…

Apr 4

Transformer alternatives in 2024

Apr 4

Stanislav Fedotov

Stanislav Fedotov

AI evangelist at Nebius, AI program lead at AI DT School

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams