Claude 3.5 Sonnet — Better than GPT4o ⭐

Multilingual Evaluation vs. GPT-4o and Gemini 1.5

Lars Wiik
4 min readJun 20, 2024

Anthropic has been at the forefront of the Large Language Model (LLM) race for the past year, consistently pushing the boundaries of what’s possible in AI.

Following the remarkable success of Claude 3 Opus, which has outperformed state-of-the-art models from OpenAI and Google in several benchmarks, Anthropic has now introduced Claude 3.5 Sonnet!

But is Claude 3.5 Sonnet multilingual?

In this article, I analyze the multilingual performance of Claude 3.5 Sonnet against OpenAI’s GPT-4o, Google’s Gemini 1.5, as well as Anthropic’s previous flagship model Claude 3 Opus.

I will present how each LLM performs in various languages such as Spanish, German, French, Portuguese, and Russian, as well as more niche languages.

Illustration of Claude 3.5’s performance and cost [source]

What’s new with Claude 3.5 Sonnet?

Claude 3.5 Sonnet is twice as fast as Claude Opus, incorporates enhanced reasoning capabilities compared with previous models, and showcases state-of-the-art vision capabilities.

--

--

Lars Wiik

MSc in AI — LLM Engineer ⭐ — Curious Thinker and Constant Learner