Kimi K2.6 — Benchmarks

Benchmark scores for Kimi K2.6 aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.

Coding

BenchmarkScoreRank
SWE-bench Verified76.6%#1 / 4
LiveBench Coding78.6#1 / 23

Knowledge

BenchmarkScoreRank
SimpleQA38.7%#2 / 7

Math

BenchmarkScoreRank
AIME 2024/202596.1%#1 / 29
FrontierMath39.0%#1 / 12
LiveBench Math84.3#5 / 23

Reasoning

BenchmarkScoreRank
GPQA Diamond90.8%#1 / 41
LiveBench Reasoning79.4#2 / 23

Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.