DeepSeek V3.1 — Benchmarks

Benchmark scores for DeepSeek V3.1 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Reasoning

BenchmarkScoreRank
SimpleBench40.0%#7 / 15

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.