Question 1

What is the best open LLM on SWE-bench Multilingual?

Accepted Answer

GLM 5 is the top open model on SWE-bench Multilingual, scoring 69.7%. Among all models tested — including proprietary ones — it ranks #4. The top model overall is Gemini 3 Flash (Google) at 72.7%.

Question 2

Can open models match proprietary models on SWE-bench Multilingual?

Accepted Answer

Not quite on SWE-bench Multilingual: the strongest proprietary model (Gemini 3 Flash) scores 72.7%, ahead of the best open model (GLM 5) at 69.7% — but you can run the open one yourself.

#	Model	Score
1 / 4	GLM 5 · 753.9B	69.7%
2 / 6	MiniMax M2.5 · 228.7B	68.3%
3 / 7	Kimi K2.5 · 1058.6B	67.3%
4 / 13	DeepSeek V3.2 · 685.4B	59.0%

SWE-bench Multilingual Leaderboard

Open models ranked on SWE-bench Multilingual

SWE-bench Multilingual: frequently asked questions