Llama 3.1 70B Instruct — Benchmarks

Benchmark scores for Llama 3.1 70B Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #52 of 73 open modelscomposite 37.3/100 across 5 benchmarks in 3 categories · methodology

Knowledge

Benchmark	Score	Open rank	All models
MMLU-Pro	62.8%	#48 / 119	#130 / 259
MMLU	80.1%	#10 / 76	#24 / 136

Math

Benchmark	Score	Open rank	All models
AIME 2024/2025	3.6%	#27 / 34	#140 / 155
MATH Level 5	36.7%	#19 / 32	#77 / 108

Reasoning

Benchmark	Score	Open rank	All models
GPQA Diamond	44.2%	#28 / 46	#139 / 182

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.