NVIDIA’s GTC 2024: NVIDIA Redefines the Boundaries of AI Possibility

Siddharth Sudhakar
Accredian
Published in
6 min readMar 28, 2024
Jensen Huang Introduces Project GR00T — NVIDIA GTC 2024

“The opening scene of every robot horror movie ever made.” — A netizen’s reaction to Jensen Huang unveiling the humanoid robots at GTC 2024

Introduction

If you like to keep up with technological advancements, you must have heard about the recent NVIDIA conference. NVIDIA is a technology company that has been around for quite a while. It was known primarily for its Graphics Processing Units (GPUs), but recently, NVIDIA has become a global leader in AI. To showcase its advancements in AI, NVIDIA hosts an annual GTC conference, which brings together the brightest minds in the field.

On March 18, 2024, Jensen Huang, the founder and CEO of NVIDIA, delivered the conference’s keynote, where he made some significant announcements regarding the advancements at NVIDIA related to AI that are game-changers for the industry. In this article, we’ll delve into the details of some of these advancements and explore what they mean for the future of AI.

Table of Contents:

  1. Blackwell
  2. NVIDIA Inference Microservice
  3. AI Foundry
  4. Omniverse
  5. Project GR00T
  6. Conclusion and Future Implications

Blackwell

A size comparison between Blackwell (left) and its predecessor, Hopper (right)

The Blackwell chip is a GPU microarchitecture that was announced at GTC 2024. It’s a game-changer in the field of AI. The design and organization of a GPU’s components is known as its microarchitecture. This microarchitecture is crucial to the GPU’s performance, power efficiency, and capabilities.

Further, to understand how GPUs are used in AI, it’s essential to first understand the difference between CPUs and GPUs. CPUs are designed for sequential processing, while GPUs are designed for parallel processing. This makes GPUs ideal for the high computational demands of AI. They can process thousands of threads simultaneously, which is particularly beneficial for tasks like training neural networks that require simultaneous processing of large amounts of data.

Now, let’s look at some key specifications of the Blackwell chip that make it impressive.

  • At the heart of Blackwell, there are 208 billion transistors! I had to check multiple sources to make sure this number was correct. This now makes the Blackwell the GPU with the highest number of transistors.
  • It provides 20 petaflops from a single GPU. A petaflop is 1,000 trillion (or one quadrillion) calculations per second.
  • There’s 192GB of memory with up to 8 TB/s of bandwidth.
  • With the fifth-generation NVIDIA NVLink interconnect in Blackwell, we can scale up to 576 GPUs.

Before Blackwell, a chip called Hopper was NVIDIA’s flagship chip for AI. To compare the training and inference performance of Blackwell and Hopper, Blackwell delivers up to 4 times the training performance and up to 30 times the inference performance compared to Hopper.

NVIDIA Inference Microservice

NVIDIA Inference Microservices (NIM) is a new way of packaging and delivering software, which allows developers to connect with millions of GPUs to deploy customized AI. It encapsulates NVIDIA’s extensive work on model inference and optimization, and this packaged solution is then made available as a microservice, enhancing its accessibility and ease of use.

This service reduces deployment times from weeks to mere minutes. It was announced that the company is partnering with Microsoft, Amazon, and Google to make NIM microservices available on Azure AI, SageMaker, and Kubernetes Engine.

AI Foundry

The AI Foundry Service Architecture

During the introduction of this section, Jensen described NVIDIA as an AI Foundry. This concerns how other companies can come to NVIDIA with their ideas and requirements, and NVIDIA will provide the end-to-end AI backend to make it work.

NVIDIA aims to help businesses and startups create and fine-tune customized generative AI applications using MS Azure. It allows developers to train and deploy their generative AI models quickly and effectively. Many well-known companies, such as SAP, ServiceNow, Cohesity, Snowflake, NetApp, and Dell, have already started developing AI-powered applications and systems with the help of this service.

Omniverse

“We need a simulation engine that represents the world digitally for the robot so that the robot has a gym to go learn how to be a robot. We call that virtual world Omniverse.” — Jensen Huang.

The Omniverse platform is a digital twin simulation and collaboration tool that operates in real time. It enables designers, developers, and engineers to create, test, and iterate on virtual designs in a shared virtual space.

NVIDIA’s Omniverse

At GTC 2024, NVIDIA announced Omniverse Cloud APIs. These APIs extend the reach of the Omniverse platform for creating industrial digital twin applications and workflows across the entire ecosystem of software makers.

Omniverse offers real-time rendering capabilities, comprehensive support, and interoperability, making it a powerful tool for visualizing complex data, simulating physical phenomena, and creating immersive virtual environments. Whether designing a new video game, simulating a city’s infrastructure, or making a digital twin of a physical product, Omniverse is set to revolutionize 3D workflows.

Project GR00T

During the event at GTC 2024, NVIDIA unveiled Project GR00T, a general-purpose foundation model intended for humanoid robot learning. Project GR00T has the capability of taking in a variety of instructions and past interactions and providing the appropriate action.

NVIDIA also launched Isaac Tools, which GR00T utilizes. This includes Isaac Lab, a robot learning application that trains GR00T on Omniverse Isaac Sim, and OSMO, responsible for coordinating data generation, model training, and software/hardware-in-the-loop workflows across distributed environments. The robots can observe and imitate human movements, as impressively demonstrated in the keynote.

Project GR00T in Action

Conclusion and Future Implications

NVIDIA’s recent announcements at GTC 2024 significantly impact the future of AI. The Blackwell platform’s exceptional performance and efficiency will accelerate the development and deployment of generative AI models. This breakthrough could lead to significant advancements in various fields, such as healthcare and entertainment. Moreover, NVIDIA’s Inference Microservice and AI Foundry collaborations are expected to democratize access to AI, making it easier and more efficient for developers to deploy AI solutions. As a result, we should witness a surge of AI-driven applications across various industries.

Project GR00T is set to revolutionize the field of robotics. Shortly, we could see more sophisticated robots capable of understanding natural language and emulating human actions. The Omniverse platform enhancements could transform how we design and simulate 3D workflows, which could have significant implications for the gaming, architecture, and manufacturing industries.

It will be fascinating to see how these advancements shape the landscape of AI in the coming years. However, many experts in the field of AI are concerned about the potential adverse effects this technology might have on humanity. With every significant milestone in the advancement of AI, our livelihood is at greater risk, given the impact the technology can have on our society. Nevertheless, I remain optimistic and hopeful that this will not mark the beginning of the end, as some people describe it. Ultimately, unless you’re at the forefront of AI advancement, all you can do is hope.

--

--