DeepSeek V3.1 — Benchmarks

Benchmark scores for DeepSeek V3.1 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Knowledge

Benchmark	Score	Open rank	All models
MMLU-Pro	84.8%	#7 / 119	#35 / 259

Reasoning

Benchmark	Score	Open rank	All models
SimpleBench	40.0%	#10 / 19	#61 / 90

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.