ChatGPT-4o, OpenAI’s New Flagship Model’s Full Review

Fast and Truly Multimodal

Ignacio de Gregorio
11 min readMay 14, 2024
Generated by author using ChatGPT-4o

After a year, we finally have a new model from OpenAI, the latest version of their transformer family, GPT-4o (“omnimodal”).

It’s absurdly fast in text, audio, image and video processing, and image generation. It also shows stark coding and multimodal reasoning improvements while enabling new modalities like 3D rendering.

What’s more, according to lmsys.org’s chatbot arena, it’s already the best all-around model based on the results obtained from its proxy model, the famous gpt2-chatbot we discussed two weeks ago.

But this time, the reasons behind the release aren’t about“pushing the veil of ignorance forward,” in Sam Altman’s verbatim, but putting state-of-the-art AI in the hands of billions for free.

Here’s all you need to know about ChatGPT-4o.

You are probably sick of AI newsletters talking about how this or that **just** happened. And these newsletters abound because coarsely talking about events and things that already took place is easy, but the value provided is limited and the hype exaggerated.

However, newsletters talking about what will happen are a rare sight. If you’re into easy-to-understand insights looking into the future of AI before anyone…

--

--

Ignacio de Gregorio

I break down frontier AI systems in easy-to-understand language for you. Sign up to my newsletter here: https://thetechoasis.beehiiv.com/subscribe