Llama 4 Maverick 17B 128E Instruct — Benchmarks

Benchmark scores for Llama 4 Maverick 17B 128E Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #61 of 73 open modelscomposite 29.6/100 across 9 benchmarks in 4 categories · methodology

Coding

Benchmark	Score	Open rank	All models
Aider Polyglot	15.6%	#15 / 18	#62 / 69

Knowledge

Benchmark	Score	Open rank	All models
Humanity's Last Exam	5.7%	#4 / 4	#38 / 46
MMLU-Pro	80.5%	#22 / 119	#62 / 259

Math

Benchmark	Score	Open rank	All models
AIME 2024/2025	20.6%	#16 / 34	#108 / 155
MATH Level 5	73.0%	#7 / 32	#42 / 108
FrontierMath	0.7%	#11 / 12	#94 / 101

Reasoning

Benchmark	Score	Open rank	All models
GPQA Diamond	67.0%	#15 / 46	#94 / 182
ARC-AGI	4.4%	#9 / 10	#153 / 158
SimpleBench	27.7%	#13 / 19	#71 / 90

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.