Sign in Get started

Nebius

AI-centric cloud for ML practitioners

In-house LLM R&D: Nebius AI’s secret ingredient for truly AI‑centric cloud

In-house LLM R&D: Nebius AI’s secret ingredient for truly AI‑centric cloud

Severe GPU scarcity and struggles with MLOps are forcing ML engineers to more and more divert focus from model development. Such a shift…

Anastasia Zemskova

Jul 30

Data preparation for LLMs: techniques, tools and our established pipeline

Data preparation for LLMs: techniques, tools and our established pipeline

Why are datasets for LLMs so challenging? As with any machine learning task, data is half the battle (the other half being model efficiency…

Yury Anapolskiy

Jun 27

Fundamentals of LoRA and low-rank fine-tuning

Fundamentals of LoRA and low-rank fine-tuning

In the next installment of our series of deep technical articles on AI research, let’s switch our attention to the famous LoRA, a low-rank…

Stanislav Fedotov

Jun 17

Slurm vs Kubernetes: Which to choose for your ML workloads

Slurm vs Kubernetes: Which to choose for your ML workloads

Scaling your machine learning workloads will eventually require resource orchestration. Among the multiple solutions available, the most…

Jun 10

Demo: applying retrieval-augmented generation with open tools

Demo: applying retrieval-augmented generation with open tools

Retrieval-augmented generation (RAG) is a technique that enhances language models by combining generative AI with a retrieval component…

Apr 18

Transformer alternatives in 2024

Transformer alternatives in 2024

With this article, we are starting a new category on our blog, the one dedicated to AI research. Expect these posts to be very technical…

Stanislav Fedotov

Apr 4

Tips and tricks for performing large model checkpointing

Tips and tricks for performing large model checkpointing

There are various aspects to optimize when training large models. It often lasts weeks and involves managing billions of rows of data, with…

Mar 12

Joining AI research community: overview for industry experts

Joining AI research community: overview for industry experts

The global network of ML engineers is divided into two parts: industrial and academic. The flow of information, methods of interaction, and…

Feb 21

Which AI conferences to attend in 2024?

Which AI conferences to attend in 2024?

The beginning of the year is a good time to make plans, right? If you’re an ML engineer, researcher, or technical manager, now is the…

Jan 24

NVIDIA H100 and other GPUs — which are relevant for your ML workload?

NVIDIA H100 and other GPUs — which are relevant for your ML workload?

My name is Igor, I’m the Technical Product Manager for IaaS at Nebius AI. Today, I’m going to break down the differences between NVIDIA’s…

Nov 21, 2023

About NebiusLatest StoriesArchiveAbout MediumTermsPrivacyTeams