Kimi K2 Models — Hardware Requirements

7 Kimi K2 models from Moonshot AI and the community, from the smallest that runs in 1.1 GB of VRAM up to 1058.6B parameters. Every row links to full quantization tables and GPU compatibility.

All Kimi K2 Models by Size

ModelParamsContext
Kimi K2.6 Eagle31.8B262K
Kimi K2 Instruct1026.5B131K
Kimi K2 Instruct 09051026.5B262K
Kimi K2 Thinking1058.1B262K
Kimi K2.61058.6B262K
Kimi K2.51058.6B262K
Kimi K2.7 Code1058.6B262K

How Kimi K2 Compares — Benchmark Rating

Kimi K2.6 is the highest-rated Kimi K2 model with an overall benchmark rating of 68.2/100 — #6 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.

GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
Composite of normalized public benchmark scores (methodology) · Kimi K2 · other models

Frequently Asked Questions

How much VRAM do I need to run a Kimi K2 model?
The smallest Kimi K2 model, Kimi K2.6 Eagle3, runs from 1.1 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Kimi K2 models can I run on a 16 GB GPU?
1 of 7 Kimi K2 models fit in 16 GB of VRAM at some quantization, including Kimi K2.6 Eagle3.
What is the most popular Kimi K2 model to run locally?
Kimi K2.6 is the most downloaded Kimi K2 model in local-friendly quantized formats. It runs from 295.0 GB of VRAM.
How do Kimi K2 models score on benchmarks?
Kimi K2.6 leads the family with an overall benchmark rating of 68.2/100, ranking #6 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.