Llama 2 70B Chat HF — Benchmarks
Benchmark scores for Llama 2 70B Chat HF aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Math
| Benchmark | Score | Rank |
|---|---|---|
| AIME 2024/2025 | 0.0% | #22 / 22 |
| MATH Level 5 | 3.3% | #23 / 23 |
Reasoning
| Benchmark | Score | Rank |
|---|---|---|
| GPQA Diamond | 26.3% | #24 / 28 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.