Llama 2 70B HF — Benchmarks

Benchmark scores for Llama 2 70B HF aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

BenchmarkScoreRank
MMLU69.9%#13 / 36
HellaSwag85.3%#3 / 22

Math

BenchmarkScoreRank
GSM8K63.3%#13 / 27

Reasoning

BenchmarkScoreRank
BIG-Bench Hard51.2%#6 / 11

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.