CC Capabilities

Capabilities / Benchmarks

GPT-5.1-Codex-Max System Card

OpenAI Software engineering score 86/100 confidence 0.9
Category
Benchmarks
Capability
Frontier model release and benchmark movement
Observed
2025-11-19
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

This system card outlines the comprehensive safety measures implemented for GPT‑5.1-CodexMax. It details both model-level mitigations, such as specialized safety training for harmful tasks and prompt injections, and product-level mitigations like agent sandboxing and configurable network access.

Oracle verdict

OpenAI is describing a frontier or production capability that pushes directly on the thesis. The important signal is not the marketing language; it is the widening set of tasks now being routed through model-driven execution rather than ordinary software or headcount.

Why it matters

Imported from the official OpenAI release stream because it was published on or after the GPT-5 launch date (2025-08-07).