Users Will Fall in Love With OpenAI’s New GPT-4o Model. Literally.

The company’s new GPT-4o can understand and mimic human speech and emotion

Thomas Smith
The Generator

--

Screenshot via OpenAI demo, by the author

In the iconic 2013 film Her, the protagonist develops an intense relationship — which morphs into a love affair — with a voice-enabled AI system.

The AI in Her is everything that today’s voice-enabled systems are not: emotive, funny, and able to intuit the subtleties of human conversation.

In a major announcement this morning, OpenAI announced the release of a new version of its ChatGPT system that natively integrates speech, transcription, and intelligence into a single model.

It’s powerful, intuitive, and disturbingly human-like. Essentially, OpenAI has built a real-life version of Her.

A Bad Conversationalist

ChatGPT has had voice capabilities for months now. Even today, you can open the ChatGPT app on your phone, press the headphones icon, and converse with the system using your voice.

The problem, though, was that ChatGPT was a terrible conversationalist.

Essentially, ChatGPT’s voice capabilities were a hack created by splicing together three different models.

--

--