Llama 3.1 405B — Benchmarks

Benchmark scores for Llama 3.1 405B aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

BenchmarkScoreRank
MMLU84.4%#3 / 36
HellaSwag89.2%#1 / 22

Reasoning

BenchmarkScoreRank
BIG-Bench Hard82.9%#1 / 11

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.