Model Card Explorer

Summary

Mistral Small 24B Instruct 2501 Model Card

A 371-word brief of a 1,740-word document. Published by Mistral AI. Version dated Apr 17, 2026.

What this is

Mistral-Small-24B-Instruct-2501, marketed as "Mistral Small 3," is a 24-billion-parameter instruction-fine-tuned language model released by Mistral AI in January 2025. It is built on the Mistral-Small-24B-Base-2501 base model and positioned as state-of-the-art in the sub-70B category. Its stated design goals are local deployment, fast conversational response, low-latency function calling, and fine-tuning for domain experts.

Capabilities

The model scores 0.663 on MMLU Pro 5-shot CoT, 0.453 on GPQA Main 5-shot CoT, 0.848 on HumanEval pass@1, and 0.706 on MATH instruct. Instruction-following benchmarks show 8.35 on MT-Bench, 52.27 on WildBench, and 0.873 on Arena Hard. The model supports a 32k context window, eleven or more languages including English, French, German, Spanish, Italian, Chinese, Japanese, and Korean, and provides native function calling and JSON output via a Tekken tokenizer with a 131k vocabulary.

Evaluation methodology

Public benchmarks were run through a single internal evaluation pipeline; Mistral notes that numbers "may vary slightly from previously reported performance" for comparison models. Judge-based evals — WildBench, Arena Hard, and MT-Bench — used GPT-4o-2024-05-13 as the judge. Human preference evaluations were conducted side-by-side with an external third-party vendor on over 1,000 proprietary coding and generalist prompts, with model identity anonymized; Mistral states it took "extra caution in verifying a fair evaluation" and is "confident that the above benchmarks are valid."

Safety testing

Not disclosed in this document.

Mitigations

Not disclosed in this document.

Deployment and access

The model is released under the Apache 2.0 license, permitting commercial and non-commercial use and modification. It is available via vLLM (requiring approximately 55 GB GPU RAM in bf16 or fp16), Hugging Face Transformers, and Ollama for local inference. Quantized versions (4-bit, 8-bit) can run on a single RTX 4090 or a 32 GB RAM MacBook. The card notes that enterprises needing extended context, specific modalities, or domain knowledge will be served by separate commercial models beyond this open release.

Limitations

The model's knowledge base has a cutoff of October 1, 2023, as stated in the recommended system prompt. The card includes no explicit limitations section and does not discuss failure modes, bias evaluations, or out-of-distribution behavior.

What's new

This is the January 2025 release of Mistral Small, designated "Mistral Small 3 (2501)," superseding prior Mistral Small releases. No version changelog or explicit list of changes from a prior version is included in the card.

Benchmark	Category	State	Score	Setup	Source
	coding	scored	0.8% pass at 1	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
	instruction_following	scored	0.8% accuracy	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
/ pro	knowledge	scored	0.7% accuracy	5-shotcotinstruction-tunedmissing: language	self-reported
	math	scored	0.7% accuracy	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
	other	scored	52.3 win rate	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
	other	scored	8.3 accuracy	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
	other	scored	0.9 win rate	instruction-tunedmissing: shot countmissing: methodmissing: language	self-reported
/ main	reasoning	scored	0.5% accuracy	5-shotcotinstruction-tunedmissing: language	self-reported

Mistral Small 24B Instruct 2501 Model Card

Mistral Small 24B Instruct 2501 Model Card

What this is

Capabilities

Evaluation methodology

Safety testing

Mitigations

Deployment and access

Limitations

What's new

Extracted Evaluations(8 results)