Llama 2 70B HF — Benchmarks

Benchmark scores for Llama 2 70B HF aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #11 of 73 open modelscomposite 63.8/100 across 5 benchmarks in 3 categories · methodology

Knowledge

Benchmark	Score	Open rank	All models
HellaSwag	85.3%	#6 / 42	#10 / 76
MMLU-Pro	37.5%	#87 / 119	#208 / 259
MMLU	69.9%	#26 / 76	#62 / 136

Math

Benchmark	Score	Open rank	All models
GSM8K	69.6%	#17 / 59	#26 / 93

Reasoning

Benchmark	Score	Open rank	All models
BIG-Bench Hard	64.9%	#9 / 37	#13 / 50

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.