Sitemap

What NVIDIA’s New Bet Reveals About AI.

The First of Many Specialized Chips

9 min readSep 16, 2025

--

Press enter or click to view image in full size
Source: Author using GPT-5

NVIDIA has finally presented its first-ever inference-only GPU, explicitly designed to run AI models, not train them.

It’s a dramatic shift from general-purpose GPUs to specialized ones, making NVIDIA’s next GPUs, Rubin, its first-ever disaggregated inference platform.

This is a huge deal because it’s a response against the threat companies like Cerebras or Groq pose to its dominance, but it’s also a very bold and concentrated bet that could backfire.

Reviewing the essence of AI workloads, particularly what “AI inference” actually means, we evaluate the consequences for the $4 trillion company and the future of the industry as a whole, because it reveals more than what meets the eye, especially regarding what AI models will dominate the future.

AI explained in first principles and simple words, for those allergic to hype but hungry for knowledge. Join today.

Finally.

--

--

Ignacio de Gregorio
Ignacio de Gregorio

Written by Ignacio de Gregorio

I break down AI in easy-to-understand language for you. Sign up here: https://thewhitebox.beehiiv.com/subscribe Business inquiries: nacho@thewhitebox.ai

Responses (8)