Llama 3.1 8B Instruct — Benchmarks
Benchmark scores for Llama 3.1 8B Instruct aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Knowledge
| Benchmark | Score | Rank |
|---|---|---|
| MMLU | 56.1% | #25 / 36 |
Math
| Benchmark | Score | Rank |
|---|---|---|
| AIME 2024/2025 | 2.5% | #18 / 22 |
| MATH Level 5 | 22.9% | #16 / 23 |
| GSM8K | 82.4% | #9 / 27 |
Reasoning
| Benchmark | Score | Rank |
|---|---|---|
| GPQA Diamond | 25.9% | #26 / 28 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.