NEMOTRON, Nvidia’s New ChatGPT-Level Model

The King of Reward

Ignacio de Gregorio
13 min readJun 24, 2024
Generated by author using GPT-4o

NVIDIA, not having enough of being the greatest story ever told in the public markets (even reaching the number one spot for a couple of days), has released a new model, Nemotron-340B, that beats GPT-4o (and any other model that dares to compare) in some specific areas.

Moreover, this release also includes fascinating information, such as the fact that these models:

  • excel at synthetic data generation (allowing users to generate specialized data to train their models),
  • represent a new state-of-the-art reward model coupled with an exciting and brand-new alignment method,
  • and, crucially, they show proof that weaker AIs can train stronger AIs, a counterintuitive yet critical requirement for safety training in humanity’s quest to steer more-powerful-than-us models in the near future.

Moreover, NVIDIA has released this model as a fully open-source project, providing the industry with a depth of invaluable knowledge.

Here’s all you need to know about it.

You are probably sick of AI newsletters talking about how this or that **just** happened. These newsletters abound because coarsely talking about events and things that already took place is easy, but the value provided

--

--