Llama 3.1 8B Instruct — Benchmarks

Benchmark scores for Llama 3.1 8B Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

BenchmarkScoreRank
MMLU56.1%#25 / 36

Math

BenchmarkScoreRank
AIME 2024/20252.5%#18 / 22
MATH Level 522.9%#16 / 23
GSM8K82.4%#9 / 27

Reasoning

BenchmarkScoreRank
GPQA Diamond25.9%#26 / 28

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.