CC Capabilities

Capabilities / Benchmarks

gpt-oss-safeguard technical report

OpenAI Cybersecurity score 64/100 confidence 0.9
Category
Benchmarks
Capability
Model and benchmark capability movement
Observed
2025-10-29
Thesis section
Appendix III, section one: model and benchmark capability evidence

Claim

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard’s capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using.

Oracle verdict

This belongs in the register because benchmark and model-release claims set the ceiling for the next wave of deployment stories. The labour-market effect is indirect today, but it becomes direct when these gains are packaged into agents, APIs, and enterprise tools.

Why it matters

Imported from the official OpenAI release stream because it was published on or after the GPT-5 launch date (2025-08-07).