CC Capabilities

Capabilities / Benchmarks

Measuring political bias in Claude

Anthropic General AI capability score 86/100 confidence 0.88
Category
Benchmarks
Capability
Model and benchmark capability movement
Observed
2025-11-13
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

We want Claude to be seen as fair and trustworthy by people across the political spectrum, and to be unbiased and even-handed in its approach to political topics. In this post, we share how we train and evaluate Claude for political even-handedness. We also report the results of a new, automated, open-source evaluation for political neutrality that we’ve.

Oracle verdict

Anthropic is describing a frontier or production capability that pushes directly on the thesis. The important signal is not the marketing language; it is the widening set of tasks now being routed through model-driven execution rather than ordinary software or headcount.

Why it matters

Imported from the official Anthropic release stream because it was published on or after the GPT-5 launch date (2025-08-07).