DeepSeek R1 — Benchmarks

Benchmark scores for DeepSeek R1 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #16 of 73 open modelscomposite 61.9/100 across 7 benchmarks in 4 categories · methodology

Coding

Benchmark	Score	Open rank	All models
Aider Polyglot	56.9%	#7 / 18	#31 / 69

Knowledge

Benchmark	Score	Open rank	All models
MMLU-Pro	84.0%	#10 / 119	#41 / 259

Math

Benchmark	Score	Open rank	All models
AIME 2024/2025	53.3%	#12 / 34	#86 / 155
MATH Level 5	93.0%	#2 / 32	#19 / 108

Reasoning

Benchmark	Score	Open rank	All models
SimpleBench	30.9%	#12 / 19	#69 / 90
GPQA Diamond	69.2%	#13 / 46	#88 / 182
ARC-AGI	15.8%	#6 / 10	#134 / 158

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.