Homepage
Open in app
Sign in
Get started
Nebius
AI-centric cloud for ML practitioners
Follow
Creating images with Flux: Your prompt guide
Creating images with Flux: Your prompt guide
With this complete prompt guide, you will learn practical tips for creating exceptional images with Flux on Nebius AI Studio. I will share…
Dylan Bristot
Jun 17
How we streamlined HR operations with an AI assistant (and why you should too)
How we streamlined HR operations with an AI assistant (and why you should too)
Repetitive employee queries can overwhelm HR departments and reduce time for strategic tasks. At Nebius, we implemented an AI Assistant…
Akim Tsvigun
May 1
Creating a smart cleaning robot: How Positronic Robotics uses Nebius
Creating a smart cleaning robot: How Positronic Robotics uses Nebius
Every year, people around the world spend up to 800 billion person-hours cleaning. It’s one of the most disliked chores, and businesses…
Sviatoslav Ivanov
Apr 24
Chinese AI New Year started early
Chinese AI New Year started early
If you think New Year celebrations started on January 1, it depends on how you look at it — because in the AI world, the real fireworks…
Prof. Dr. Ivan Yamshchikov
Jan 27
Eliminating hallucinations in LLM training: are we there yet?
Eliminating hallucinations in LLM training: are we there yet?
LLM hallucinations are model outputs that appear ungrounded, inconsistent or nonsensical — a real bane for any AI developer. It’s quite…
Stanislav Fedotov
Dec 17, 2024
Explaining Soperator, Nebius’ open-source Kubernetes operator for Slurm
Explaining Soperator, Nebius’ open-source Kubernetes operator for Slurm
We’ve built an open-source Kubernetes operator that runs and manages Slurm clusters as K8s resources. From this article, you’ll find out…
Mikhail Mokrushin
Sep 24, 2024
How transformers, RNNs and SSMs are more alike than you think
How transformers, RNNs and SSMs are more alike than you think
By uncovering surprising links between seemingly unrelated LLM architectures, a way might be paved for effective idea exchange and boosting…
Stanislav Fedotov
Sep 6, 2024
Cloud architects team: lessons learned
Cloud architects team: lessons learned
It’s a little quieter now — as is often the case in August — in terms of big AI conferences, announcements, and product launches. Everyone…
Levon Sarkisyan
Aug 28, 2024
Choosing storage for deep learning: a comprehensive guide
Choosing storage for deep learning: a comprehensive guide
The rapid evolution of deep learning models has brought about unprecedented growth in both their size and complexity. This trend, while…
Igor Ofitserov
Aug 15, 2024
Mixtures of Experts and scaling laws
Mixtures of Experts and scaling laws
Mixture of Experts (MoE) has become popular as an efficiency-boosting architectural component for LLMs. In this blog post, we’ll explore…
Stanislav Fedotov
Aug 13, 2024
How large models can abstract rules: a research by LIMS
How large models can abstract rules: a research by LIMS
How well can LLMs abstract rules and how to test such ability? Here’s a research by the London Institute for Mathematical Sciences (LIMS).
Leonid Kliuev
Aug 12, 2024
In-house LLM R&D: Nebius AI’s secret ingredient for truly AI‑centric cloud
In-house LLM R&D: Nebius AI’s secret ingredient for truly AI‑centric cloud
Severe GPU scarcity and struggles with MLOps are forcing ML engineers to more and more divert focus from model development. Such a shift…
Anastasia Zemskova
Jul 30, 2024
Data preparation for LLMs: techniques, tools and our established pipeline
Data preparation for LLMs: techniques, tools and our established pipeline
Why are datasets for LLMs so challenging? As with any machine learning task, data is half the battle (the other half being model efficiency…
Yury Anapolskiy
Jun 27, 2024
Fundamentals of LoRA and low-rank fine-tuning
Fundamentals of LoRA and low-rank fine-tuning
In the next installment of our series of deep technical articles on AI research, let’s switch our attention to the famous LoRA, a low-rank…
Stanislav Fedotov
Jun 17, 2024
Slurm vs Kubernetes: Which to choose for your ML workloads
Slurm vs Kubernetes: Which to choose for your ML workloads
Scaling your machine learning workloads will eventually require resource orchestration. Among the multiple solutions available, the most…
Panu Koskela
Jun 10, 2024
Demo: applying retrieval-augmented generation with open tools
Demo: applying retrieval-augmented generation with open tools
Retrieval-augmented generation (RAG) is a technique that enhances language models by combining generative AI with a retrieval component…
Roman Bunin
Apr 18, 2024
Transformer alternatives in 2024
Transformer alternatives in 2024
With this article, we are starting a new category on our blog, the one dedicated to AI research. Expect these posts to be very technical…
Stanislav Fedotov
Apr 4, 2024
Tips and tricks for performing large model checkpointing
Tips and tricks for performing large model checkpointing
There are various aspects to optimize when training large models. It often lasts weeks and involves managing billions of rows of data, with…
Simon Karasik
Mar 12, 2024
Joining AI research community: overview for industry experts
Joining AI research community: overview for industry experts
The global network of ML engineers is divided into two parts: industrial and academic. The flow of information, methods of interaction, and…
Leonid Kliuev
Feb 21, 2024
About Nebius
Latest Stories
Archive
About Medium
Terms
Privacy
Teams