Anthropic’s Claude 3 Beats GPT-4 Across Main Metrics
The race is on, and it should not be over anytime soon.
Every week, even every day, new breakthroughs are happening in the AI space, and here is a new one.
Anthropic just released (On February 4th, 2024), its Claude 3 series, a new family of models: Opus, Sonnet, and Haiku.
The models are available on test, on Claude.ai but also via API.
So what is new about this new release, on how does it outperform OpenAI ChatGPT4?
Performance
The benchmark made with Claude 3 (Opus) shows better accuracy against GPT4 on Undergraduate level knowledge (86,8% vs 86,4%), Graduate level reasoning (50,4% vs 35,7%), Grade school math (95% vs 92%), Math problem solving (60,1% vs 52,9%), Multilingual Math (90,7% vs 74,5%), Code (84,9% vs 67%), Reasoning over text (83,1% vs 80,9%) and so on.
It also beats Gemini 1.0 Ultra on the same benchmarks.
Here is the full benchmark matrix shared by Anthropic.
Capabilities
The Claude 3 models offer multimodal capabilities, enabling them to understand both text and visual inputs. This feature is crucial for analyzing and processing complex, unstructured information in a variety of formats. It…