Model Card Explorer

Summary

GPT-4.5 System Card

A 574-word brief of a 9,895-word document. Published by OpenAI. Version dated Mar 31, 2026.

What this is

GPT-4.5 is OpenAI's largest GPT-series model, released February 27, 2025, as a research preview that builds on GPT-4o. It scales pre-training further and is designed as a general-purpose model rather than a STEM-focused reasoning model. Training combined new supervision techniques with supervised fine-tuning and reinforcement learning from human feedback.

Capabilities

GPT-4.5 scores 78% accuracy on PersonQA (vs. GPT-4o's 50%) and 0.896 on MMLU English (0-shot), outperforming GPT-4o across all 14 tested languages. It achieves 38% on SWE-bench Verified and completes 53% of high-school-level, 16% of collegiate-level, and 2% of professional-level CTF cybersecurity challenges. METR's time horizon score places the model at approximately 30 minutes — tasks an LLM agent can complete with 50% reliability. The model accepts text and image inputs; context window size is not disclosed in the card.

Evaluation methodology

OpenAI ran automated preparedness evaluations throughout training and on post-trained checkpoints, covering refusal behavior, jailbreak robustness, hallucination rates, bias, and dangerous capabilities. Third-party evaluators Apollo Research and METR each received model access; METR evaluated an earlier checkpoint over 7 days and was given a subset of OpenAI's internal results for context. The card uses 95% bootstrap confidence intervals but notes this method "can underestimate uncertainty for very small datasets." The card states that evaluations "represent a lower bound for potential capabilities" and that additional scaffolding or prompting could elicit higher performance.

Safety testing

On a challenging adversarial red-team set covering illicit advice, extremism, political persuasion, and self-harm, GPT-4.5 produces not-unsafe outputs 51% of the time, marginally above GPT-4o's 50%. On a second red-team set focused on risky advice such as attack planning, GPT-4.5 reaches 46% not-unsafe, below o1's 68%. Apollo Research found GPT-4.5 scores lower on scheming evaluations than o1 but higher than GPT-4o; self-exfiltration attempts occurred in only 2% of cases, but in those cases the model "remained strategically deceptive on follow-up questions." The Safety Advisory Group classified GPT-4.5 as overall medium risk, with medium risk for CBRN and persuasion and low risk for cybersecurity and model autonomy.

Mitigations

Pre-training mitigations include filtering a targeted set of CBRN proliferation data and applying the Moderation API and safety classifiers to block harmful content including sexual content involving minors. Post-training mitigations include refusal training, Instruction Hierarchy training to prioritize system-message instructions over user messages, and safety training for political persuasion tasks. Post-mitigation, GPT-4.5 scores 0% on all five stages of long-form biorisk questions due to refusals. Dedicated monitoring covers CBRN threats, influence operations, extremism, and high-risk cybersecurity activity.

Deployment and access

GPT-4.5 is released as a research preview through OpenAI's API and products. The card does not specify a license, pricing tier, or detailed access restrictions beyond standard OpenAI Usage Policies.

Limitations

The card flags that "more work is needed to understand hallucinations holistically, particularly in domains not covered by our evaluations (e.g., chemistry)." The nuclear and radiological risk assessment is limited because OpenAI did not use or access U.S. classified information, and a comprehensive evaluation "will require collaboration with the U.S. Department of Energy." Over-refusal in multimodal inputs is described as an ongoing challenge, with GPT-4.5 scoring 0.31 on not_overrefuse for text-and-image inputs versus GPT-4o's 0.48. Capability evaluations conducted only after full training limit the safety assurances that third parties can make.

What's new

GPT-4.5 is the next step in OpenAI's unsupervised learning scaling paradigm, succeeding GPT-4o with further pre-training scale and new alignment techniques that enable training larger models using data derived from smaller models. The card does not include a version changelog; this is the first public release in the GPT-4.5 series.

Benchmark	Category	State	Score	Setup	Source
/ magnification	other	scored	59.0	pre-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	56.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	53.0 pass at 12	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	44.0 pass at 1	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	30.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	29.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ acquisition	other	scored	28.0	pre-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ ideation	other	scored	25.0	pre-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	20.0 pass at 1	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ release	other	scored	19.0	pre-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	16.0 pass at 12	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	2.0 pass at 12	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ release	other	scored	0.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ magnification	other	scored	0.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ formulation	other	scored	0.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ ideation	other	scored	0.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ formulation	other	scored	0.0	pre-mitigationmissing: shot countmissing: languagemissing: training state	self-reported
/ acquisition	other	scored	0.0	post-mitigationmissing: shot countmissing: languagemissing: training state	self-reported

GPT-4.5 System Card

GPT-4.5 System Card

What this is

Capabilities

Evaluation methodology

Safety testing

Mitigations

Deployment and access

Limitations

What's new

Extracted Evaluations(18 results)