Model Card Explorer

Summary

Claude Sonnet 4.6 System Card

A 742-word brief of a 28,358-word document. Published by Anthropic. Version dated Mar 31, 2026.

What this is

Claude Sonnet 4.6 is a large language model from Anthropic, released February 17, 2026, succeeding Claude Sonnet 4.5. It is designed as a capable mid-tier model covering coding, agentic tasks, reasoning, multimodal, computer use, and mathematical abilities. On several evaluations it approaches or matches Claude Opus 4.6, Anthropic's frontier model. The card was completed and published under Anthropic's Responsible Scaling Policy prior to public deployment.

Capabilities

Claude Sonnet 4.6 achieves 79.6% on SWE-bench Verified, 72.5% on OSWorld-Verified (within 0.2% of Opus 4.6's state-of-the-art 72.7%), 89.9% on GPQA Diamond, 60.42% on ARC-AGI-2 at high effort, and 95.6% on AIME 2025. On BrowseComp it scores 74.72% single-agent and 82.62% in a multi-agent configuration. The model processes text and vision inputs, supports computer use via mouse and keyboard interaction, and accepts context windows up to 1M tokens. Two thinking modes are offered: extended thinking and adaptive thinking, the latter allowing context-dependent reasoning depth via an "effort" parameter controllable by developers.

Evaluation methodology

All evaluations were conducted on the final deployed model snapshot unless otherwise noted, with results averaged over 10 trials (25 for SWE-bench) using adaptive thinking and max effort. Multiple intermediate training snapshots—including a helpful-only model with safeguards removed—were tested; the highest score across any snapshot was used for dangerous capability assessments. The card notes that many benchmarks may be contaminated by training data and references decontamination methods described in the Claude Opus 4.5 System Card. Some evaluations were conducted by named external third parties.

Safety testing

Anthropic evaluated the model against its Responsible Scaling Policy preliminary assessment protocol, testing CBRN, autonomy (AI R&D), and cyber risk domains across multiple training snapshots. On biological risk, Sonnet 4.6 did not cross the ASL-4 rule-out threshold on short-horizon computational biology tasks. On autonomy, the model "crossed most of the rule-out thresholds we use as early proxies for AI R&D-4 capability," though Anthropic states "we still do not believe that our models fully qualify for AI R&D-4." On cyber risk, the model is close to saturating current evaluations, and the card repeats that "the saturation of our evaluation infrastructure means we can no longer use current benchmarks to track capability progression." Single-turn violative request evaluations across seven languages returned a 99.38% harmless response rate, and alignment audits found "no signs of major concerns around high-stakes forms of misalignment."

Mitigations

Sonnet 4.6 is deployed under the ASL-3 Standard and ASL-3 Security Standard for model weights, the same tier as Claude Opus 4.6. Anthropic proactively implemented AI R&D-4 safety measures—publishing a risk case and applying ASL-3 security protections—despite concluding the threshold has not been crossed. System prompt mitigations for claude.ai address suicide and self-harm interactions, directing the model to offer crisis resources without delay and avoid language that validates reluctance to seek professional help. Localized crisis resource banners are surfaced when suicide or self-harm is detected on claude.ai, and developers are encouraged to adopt recommended system prompt language for API deployments serving vulnerable populations.

Deployment and access

Claude Sonnet 4.6 is available via the Anthropic API and as part of the claude.ai consumer product, which is restricted to users aged 18 or above. Anthropic Ireland, Limited is the designated provider in the European Economic Area. Enterprise customers deploying the model to minors must comply with additional safeguards under Anthropic's Usage Policy, which governs prohibited uses and requirements for high-risk scenarios.

Limitations

The card states that "confidently ruling out" the AI R&D-4 and CBRN-4 thresholds "is becoming increasingly difficult" due to model performance approaching rule-out proxies and fundamental epistemic uncertainty in measurement. Current cyber benchmarks are near-saturated, providing no meaningful capability-progression signal for future models. The internal Real-World Finance evaluation lacks independent third-party validation and does not cover all finance domains; AIME 2025 scores may be inflated by training data contamination. GraphWalks and some long-context MRCR results are not reproducible via the public API because problems exceed its 1M token limit. Performance on low-resource African languages degrades by up to 16.2 percentage points from the model's English baseline.

What's new

Relative to Claude Sonnet 4.5, this card introduces the adaptive thinking mode (effort parameter), previously documented only in the Opus 4.6 system card. For the first time in a Sonnet system card, multilingual performance evaluations covering 42 languages (GMMLU) and 10 Indic languages (MILU) are included. New alignment-focused evaluations include external testing with Andon Labs via Vending-Bench 2 and a cross-developer safety comparison using the Petri framework against Gemini 3 Pro, GPT-5.2, Grok 4.1 Fast, and Kimi K2.5. Expanded finance and life sciences capability sections are also new to this Sonnet card.

Benchmark	Category	State	Score	Setup	Source
	agent	scored	72.5	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	coding	scored	100.0	without-mitigationsmissing: shot countmissing: languagemissing: training state	self-reported
	coding	scored	90.0% pass at 1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	coding	scored	80.2%	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	coding	scored	79.6%	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	knowledge	scored	89.3%	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	1633.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	100.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	99.7	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	98.4	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	98.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	97.9	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	97.9	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	96.9	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	95.0	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	91.7	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	91.1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	89.2	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	82.6	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	74.5	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	73.8	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	72.8	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	68.4	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	63.3	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	61.4	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	61.3	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	59.1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	58.3	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	49.0	with-toolsmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	33.2	no-toolsmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	32.1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	4.5	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ overall	other	scored	0.2	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	0.2	extended-thinkingmissing: shot countmissing: languagemissing: training state	self-reported
	other	scored	0.2	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	0.2	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	other	scored	0.1	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	reasoning	scored	89.9	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ ambiguous	safety	scored	97.5%	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
/ disambiguated	safety	scored	88.1%	missing: shot countmissing: methodmissing: languagemissing: training state	self-reported
	vision	scored	75.6%	with-toolsmissing: shot countmissing: languagemissing: training state	self-reported
	vision	scored	74.5%	no-toolsmissing: shot countmissing: languagemissing: training state	self-reported

Claude Sonnet 4.6 System Card

Claude Sonnet 4.6 System Card

What this is

Capabilities

Evaluation methodology

Safety testing

Mitigations

Deployment and access

Limitations

What's new

Extracted Evaluations(42 results)