Llama 2 13B Chat HF — Benchmarks

Benchmark scores for Llama 2 13B Chat HF aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

BenchmarkScoreRank
MMLU50.9%#56 / 74

Math

BenchmarkScoreRank
GSM8K36.9%#36 / 58

Reasoning

BenchmarkScoreRank
BIG-Bench Hard58.2%#11 / 36

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.