GLM 5.1 — Benchmarks
Benchmark scores for GLM 5.1 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Coding
| Benchmark | Score | Rank |
|---|---|---|
| SWE-bench Verified | 74.2% | #1 / 2 |
Knowledge
| Benchmark | Score | Rank |
|---|---|---|
| SimpleQA | 37.3% | #2 / 4 |
Math
| Benchmark | Score | Rank |
|---|---|---|
| AIME 2024/2025 | 92.2% | #1 / 22 |
| FrontierMath | 33.5% | #1 / 3 |
Reasoning
| Benchmark | Score | Rank |
|---|---|---|
| GPQA Diamond | 85.5% | #2 / 28 |
| SimpleBench | 58.7% | #1 / 10 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.