Question 1

What is the best open LLM on Humanity's Last Exam?

Accepted Answer

Kimi K2.5 is the top open model on Humanity's Last Exam, scoring 24.4%. Among all models tested — including proprietary ones — it ranks #13. The top model overall is Gemini 3.1 Pro Preview (Google DeepMind) at 46.4%.

Question 2

Can open models match proprietary models on Humanity's Last Exam?

Accepted Answer

Not quite on Humanity's Last Exam: the strongest proprietary model (Gemini 3.1 Pro Preview) scores 46.4%, ahead of the best open model (Kimi K2.5) at 24.4% — but you can run the open one yourself.

#	Model	Score
1 / 13	Kimi K2.5 · 1058.6B	24.4%
2 / 30	GLM 4.5 · 358.3B	8.3%
3 / 31	GLM 4.5 Air · 110.5B	8.1%
4 / 38	Llama 4 Maverick 17B 128E Instruct · 401.6B	5.7%

Humanity's Last Exam Leaderboard

Open models ranked on Humanity's Last Exam

Humanity's Last Exam: frequently asked questions