Llama 3.1 8B Instruct — Benchmarks

Benchmark scores for Llama 3.1 8B Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #67 of 73 open modelscomposite 25.7/100 across 6 benchmarks in 3 categories · methodology

Knowledge

Benchmark	Score	Open rank	All models
MMLU-Pro	44.3%	#72 / 119	#183 / 259
MMLU	56.1%	#51 / 76	#100 / 136

Math

Benchmark	Score	Open rank	All models
AIME 2024/2025	2.5%	#29 / 34	#144 / 155
MATH Level 5	22.9%	#23 / 32	#86 / 108
GSM8K	82.4%	#12 / 59	#19 / 93

Reasoning

Benchmark	Score	Open rank	All models
GPQA Diamond	25.9%	#43 / 46	#177 / 182

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.