StableBeluga2 — Benchmarks

Benchmark scores for StableBeluga2 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Overall rank: #7 of 73 open modelscomposite 69.1/100 across 4 benchmarks in 3 categories · methodology

Knowledge

Benchmark	Score	Open rank	All models
HellaSwag	84.1%	#9 / 42	#14 / 76
MMLU	68.6%	#30 / 76	#71 / 136

Math

Benchmark	Score	Open rank	All models
GSM8K	69.6%	#18 / 59	#27 / 93

Reasoning

Benchmark	Score	Open rank	All models
BIG-Bench Hard	69.3%	#8 / 37	#12 / 50

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.