NEMOTRON, Nvidia’s New ChatGPT-Level Model

The King of Reward

13 min readJun 24, 2024

NVIDIA, not having enough of being the greatest story ever told in the public markets (even reaching the number one spot for a couple of days), has released a new model, Nemotron-340B, that beats GPT-4o (and any other model that dares to compare) in some specific areas.

Moreover, this release also includes fascinating information, such as the fact that these models:

excel at synthetic data generation (allowing users to generate specialized data to train their models),
represent a new state-of-the-art reward model coupled with an exciting and brand-new alignment method,
and, crucially, they show proof that weaker AIs can train stronger AIs, a counterintuitive yet critical requirement for safety training in humanity’s quest to steer more-powerful-than-us models in the near future.

Moreover, NVIDIA has released this model as a fully open-source project, providing the industry with a depth of invaluable knowledge.

Here’s all you need to know about it.

You are probably sick of AI newsletters talking about how this or that **just** happened. These newsletters abound because coarsely talking about events and things that already took place is easy, but the value provided…

NEMOTRON, Nvidia’s New ChatGPT-Level Model

The King of Reward

Written by Ignacio de Gregorio