Phi 2 — Benchmarks

Benchmark scores for Phi 2 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

BenchmarkScoreRank
MMLU56.3%#24 / 36
HellaSwag53.6%#18 / 22

Reasoning

BenchmarkScoreRank
BIG-Bench Hard59.4%#3 / 11

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.