CC Capabilities

Capabilities / Benchmarks

Introducing Claude Sonnet 4.5

Anthropic Software engineering score 96/100 confidence 0.88
Category
Benchmarks
Capability
Frontier model release and benchmark movement
Observed
2025-09-29
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

Claude Sonnet 4.5 is the best coding model in the world. It's the strongest model for building complex agents. It’s the best model at using computers. And it shows substantial gains in reasoning and math. Code is everywhere. It runs every application, spreadsheet, and software tool you use. Being able to use those tools and reason through hard problems is.

Oracle verdict

Anthropic is describing a frontier or production capability that pushes directly on the thesis. The important signal is not the marketing language; it is the widening set of tasks now being routed through model-driven execution rather than ordinary software or headcount.

Why it matters

Imported from the official Anthropic release stream because it was published on or after the GPT-5 launch date (2025-08-07).