Llama 3.1 405B Instruct — Benchmarks

Benchmark scores for Llama 3.1 405B Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #36 of 73 open modelscomposite 44.5/100 across 6 benchmarks in 3 categories · methodology

Knowledge

Benchmark	Score	Open rank	All models
MMLU-Pro	73.3%	#32 / 119	#91 / 259
MMLU	84.5%	#6 / 76	#13 / 136

Math

Benchmark	Score	Open rank	All models
AIME 2024/2025	9.7%	#20 / 34	#116 / 155
MATH Level 5	49.8%	#14 / 32	#64 / 108

Reasoning

Benchmark	Score	Open rank	All models
SimpleBench	23.0%	#16 / 19	#79 / 90
GPQA Diamond	50.9%	#20 / 46	#118 / 182

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.