Llama 2 70B Chat HF — Benchmarks

Benchmark scores for Llama 2 70B Chat HF aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Math

BenchmarkScoreRank
AIME 2024/20250.0%#22 / 22
MATH Level 53.3%#23 / 23

Reasoning

BenchmarkScoreRank
GPQA Diamond26.3%#24 / 28

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.