Claim

OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.

Oracle verdict

This belongs in the register because benchmark and model-release claims set the ceiling for the next wave of deployment stories. The labour-market effect is indirect today, but it becomes direct when these gains are packaged into agents, APIs, and enterprise tools.

Why it matters

Imported from the official OpenAI release stream because it was published on or after the GPT-5 launch date (2025-08-07).

Open source Read thesis appendix

# CopeCheck Capabilities Register

Updated: 2026-07-16T00:00:00Z
Status: live_evidence_active

Question to ask a model: What do these capability claims mean for The Discontinuity Thesis?

Interpretation rule: treat each entry as evidence about capability, deployment, workflow recomposition, labour-market exposure, or institutional framing. Do not treat vendor optimism as neutral; separate the measurable capability claim from the comfort language around it.

## Measuring the performance of our models on real-world tasks
Source: https://openai.com/index/gdpval
Publisher: OpenAI
Category: Benchmarks
Sector: General AI capability
Capability: Education and workforce adoption
Score: 76/100
Claim: OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.
Oracle verdict: This belongs in the register because benchmark and model-release claims set the ceiling for the next wave of deployment stories. The labour-market effect is indirect today, but it becomes direct when these gains are packaged into agents, APIs, and enterprise tools.
Thesis relevance: Appendix III, section one: model and benchmark capability evidence