CC Capabilities

Capabilities / Benchmarks

Partnering with Mozilla to improve Firefox’s security

Anthropic Software engineering score 76/100 confidence 0.88
Category
Benchmarks
Capability
Cyber defence and misuse monitoring
Observed
2026-03-06
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

AI models can now independently identify high-severity vulnerabilities in complex software. As we recently documented, Claude found more than 500 zero-day vulnerabilities (security flaws that are unknown to the software’s maintainers) in well-tested open-source software. In this post, we share details of a collaboration with researchers at Mozilla in which.

Oracle verdict

This belongs in the register because benchmark and model-release claims set the ceiling for the next wave of deployment stories. The labour-market effect is indirect today, but it becomes direct when these gains are packaged into agents, APIs, and enterprise tools.

Why it matters

Imported from the official Anthropic release stream because it was published on or after the GPT-5 launch date (2025-08-07).