Question 1

What is the best open LLM on SWE-bench Lite?

Accepted Answer

Qwen3 Coder 30B A3B Instruct is the top open model on SWE-bench Lite, scoring 49.7%. Among all models tested — including proprietary ones — it ranks #8. The top model overall is ExpeRepair-v1.0 + Claude 4 Sonnet at 60.3%.

Question 2

What's the best SWE-bench Lite model you can run on a 24 GB GPU?

Accepted Answer

Qwen3 Coder 30B A3B Instruct is the highest-scoring open model that fits in 24 GB at 4-bit quantization (about 17 GB), scoring 49.7% on SWE-bench Lite.

Question 3

Can open models match proprietary models on SWE-bench Lite?

Accepted Answer

Not quite on SWE-bench Lite: the strongest proprietary model (ExpeRepair-v1.0 + Claude 4 Sonnet) scores 60.3%, ahead of the best open model (Qwen3 Coder 30B A3B Instruct) at 49.7% — but you can run the open one yourself.

#	Model	Score
1 / 8	Qwen3 Coder 30B A3B Instruct · 30.5B	49.7%
2 / 34	DeepSeek v3 · 684.5B	36.7%
3 / 46	DeepSeek V3.2 · 685.4B	30.7%

SWE-bench Lite Leaderboard

Open models ranked on SWE-bench Lite

SWE-bench Lite: frequently asked questions