CC Capabilities

Capabilities / Benchmarks

Measuring the performance of our models on real-world tasks

OpenAI General AI capability score 76/100 confidence 0.9
Category
Benchmarks
Capability
Education and workforce adoption
Observed
2025-09-25
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.

Oracle verdict

This belongs in the register because benchmark and model-release claims set the ceiling for the next wave of deployment stories. The labour-market effect is indirect today, but it becomes direct when these gains are packaged into agents, APIs, and enterprise tools.

Why it matters

Imported from the official OpenAI release stream because it was published on or after the GPT-5 launch date (2025-08-07).