Phi 2 — Benchmarks
Benchmark scores for Phi 2 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Knowledge
Reasoning
| Benchmark | Score | Rank |
|---|---|---|
| BIG-Bench Hard | 59.4% | #3 / 11 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.