Anthropic’s Claude 3 Beats GPT-4 Across Main Metrics

3 min readMar 4, 2024

The race is on, and it should not be over anytime soon.

Every week, even every day, new breakthroughs are happening in the AI space, and here is a new one.

Anthropic just released (On February 4th, 2024), its Claude 3 series, a new family of models: Opus, Sonnet, and Haiku.

The models are available on test, on Claude.ai but also via API.

So what is new about this new release, on how does it outperform OpenAI ChatGPT4?

Performance

The benchmark made with Claude 3 (Opus) shows better accuracy against GPT4 on Undergraduate level knowledge (86,8% vs 86,4%), Graduate level reasoning (50,4% vs 35,7%), Grade school math (95% vs 92%), Math problem solving (60,1% vs 52,9%), Multilingual Math (90,7% vs 74,5%), Code (84,9% vs 67%), Reasoning over text (83,1% vs 80,9%) and so on.

It also beats Gemini 1.0 Ultra on the same benchmarks.

Here is the full benchmark matrix shared by Anthropic.

Claude 3 benchmark vs main LLMs like GPT4 and Gemini — Table from Anthropic

Capabilities

The Claude 3 models offer multimodal capabilities, enabling them to understand both text and visual inputs. This feature is crucial for analyzing and processing complex, unstructured information in a variety of formats. It…

Anthropic’s Claude 3 Beats GPT-4 Across Main Metrics

Performance

Capabilities

Written by Ahmed Fessi