StableBeluga2 — Benchmarks
Benchmark scores for StableBeluga2 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Overall rank: #4 of 75 open modelscomposite 69.1/100 across 4 benchmarks in 3 categories · methodology
Knowledge
Math
| Benchmark | Score | Open rank | All models |
|---|---|---|---|
| GSM8K | 69.6% | #18 / 59 | #27 / 93 |
Reasoning
| Benchmark | Score | Open rank | All models |
|---|---|---|---|
| BIG-Bench Hard | 69.3% | #8 / 37 | #12 / 50 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.