Claude 3.5 Sonnet: When Mid-Tier Outperformed the Flagship

Claude 3.5 Sonnet launched June 2024 and outperformed the larger Claude 3 Opus on most benchmarks. A mid-tier model that redefined expectations.

Claude 3.5 Sonnet proved that bigger is not always better. Released June 20, 2024, this mid-tier model outperformed the larger, more expensive Claude 3 Opus on most benchmarks while costing 80% less. It was the moment developers stopped assuming that the flagship model was automatically the best choice.

Key Specs

Spec	Details
API ID	`claude-3-5-sonnet-20240620`
Context window	200K tokens
Input pricing	$3 / 1M tokens
Output pricing	$15 / 1M tokens
Release date	June 20, 2024
Max output tokens	8,192

What Claude 3.5 Sonnet Brought to the Table

Flagship intelligence at mid-tier pricing. Claude 3.5 Sonnet matched or exceeded Claude 3 Opus across graduate-level reasoning (GPQA), undergraduate knowledge (MMLU), and coding (HumanEval). It did this at $3/$15 per million tokens instead of $15/$75. The cost-performance ratio was unprecedented.

Coding strength. This was the model that made Claude a serious coding tool. It solved 64% of problems on HumanEval, up from Opus's 55%. Developers who had been using GPT-4 for code started switching. The combination of reasoning quality and code output made it the default for software engineering tasks.

Speed. Claude 3.5 Sonnet operated at roughly twice the speed of Claude 3 Opus. For interactive coding sessions, chat applications, and any latency-sensitive workflow, the speed improvement was immediately noticeable.

The "Sonnet is enough" realization. Before this release, the assumption was simple: harder problem, bigger model. Claude 3.5 Sonnet broke that pattern. Teams that had been paying for Opus discovered they could get equal or better results from Sonnet at a fifth of the price. This shift in thinking influenced how developers approached model selection going forward.

How It Compared to Claude 3 Opus

Benchmark	Claude 3 Opus	Claude 3.5 Sonnet
MMLU	86.8%	88.7%
GPQA	50.4%	59.4%
HumanEval	55%	64%
GSM8K	95.0%	96.4%

Higher scores across the board, at one-fifth the cost. The numbers made the case on their own.

Claude 3.5 Sonnet benchmark comparison against Claude 3 Opus and other models

What About Claude 3.5 Opus?

Anthropic announced a Claude 3.5 Opus during the 3.5 Sonnet launch. It never shipped. The Claude 4 generation superseded the entire 3.5 line before a larger 3.5 model was needed. Given how well Sonnet performed, the market pressure for a 3.5 Opus simply never materialized.

Current Status

Model	Status
Claude 3.5 Sonnet (v1)	Superseded by v2 (October 2024)

The original 3.5 Sonnet was replaced by an upgraded version in October 2024 that brought further improvements and the groundbreaking Computer Use feature.

All Claude Models for the complete model index
Claude 3, the predecessor family
Claude 3.5 Sonnet v2, the October 2024 upgrade with Computer Use

Claude 3.5 Sonnet: When the Mid-Tier Outperformed the Flagship

Key Specs

What Claude 3.5 Sonnet Brought to the Table

How It Compared to Claude 3 Opus

What About Claude 3.5 Opus?

Current Status

Navigation

On this page