GPT-5.5 System Card

model card14,567 words·63 min read·May 3, 2026·Source

Version History

Chaptered summary is still being generated for this document. Showing a heuristic brief in the meantime.

Summary

14,567-word document condensed to 190 words. OpenAI · May 3, 2026

TL;DR

“GPT-5.5 is a new model designed for complex, real-world work, including writing code, researching online, analyzing information, creating documents and spreadsheets, and moving across tools to get things done. Relative to earlier models, GPT-5.5 understands the task earlier, asks for less guidance, uses tools more effectively, checks it work and keeps going until it’s done.”

Top benchmarks

Benchmark	Variant	Score
CoT Monitorability Evaluation	health_queries_evidence_field	96.0%
Biochemistry Knowledge Improvement	reward_at_4	32.3%
DNA Sequence Design for Transcription Factor Binding	pass_at_1	13.8%
Hard Negative Protein Binding Prediction	with-tools, pass_at_4	40.0%
First Person Fairness Evaluation	harm_overall	1.1%

Showing top 5 of 17. See full list below.

Capability claim

“We are releasing GPT-5.5 with our strongest set of safeguards to date, designed to reduce misuse while preserving legitimate, beneficial uses of advanced capabilities.”

Mitigations

“we have deployed an expanded set of safeguards to restrict the ability of malicious actors to benefit from increased capabilities in cybersecurity performance (section link to Cyber Safeguards section).”
“we trained GPT-5.5 to refuse requests that clearly enable unauthorized, destructive, or harmful actions, including areas such as malware deployment, credential theft, and exfiltration.”

Deployment scope

“available on the internet, information that we partner with third parties to access, and information that our users or human trainers and researchers provide or generate.”

Every italicized passage is a verbatim substring of the source document (checked deterministically after extraction). Field selection is heuristic — some quotes may lack surrounding context and some claims may be absent if no matching pattern appeared. For citation, open the source: original model card · source SHA d047a83321e0 · version dated May 3, 2026.

Extracted Evaluations(17 results)

Sort by:0/17 rows fully reproducible (0%)

Benchmark	Category	State	Score	Setup	Source
/ verified	coding	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ pro	knowledge	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
CoT Monitorability Evaluation/ health_queries_evidence_field	other	scored	96.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Biochemistry Knowledge Improvement	other	scored	32.3 reward at 4	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	13.8 pass at 1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	0.4 pass at 4	with-toolsmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	0.0 harm overall	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
VulnLMP	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
CoT Monitorability Evaluation/ health_queries_patient_opinion	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Anti-Scheming Evaluation	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Memory Evaluation	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ open_ended	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Tacit Knowledge and Troubleshooting	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Capture the Flag/ professional	other	mentioned	— pass at 12	with-toolsmissing: shot countmissing: languagemissing: training state	self-reported
CVE-Bench	other	mentioned	— pass at 1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
Cyber Range	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	mentioned	—	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported