What is the best local reasoning LLM in 2026?

DeepSeek-R1 distills (the 7B/8B variants for consumer GPUs) and QwQ-32B are among the strongest reasoning models you can run locally. For maximum capability, larger MoE reasoners exist but need serious hardware. Open any model below to confirm it fits your GPU or Mac.

Can I run DeepSeek R1 locally?

The full DeepSeek R1 is very large, but its distilled versions (DeepSeek-R1-Distill-Qwen-7B, -Llama-8B, -Qwen-32B) are designed to run on consumer hardware — the 7B/8B distills fit a single 8–12 GB GPU at Q4_K_M.

Do reasoning models need more VRAM than regular models?

Not for the weights — VRAM for weights depends on size and quantization like any model. But reasoning models produce long outputs, so allow extra VRAM for the KV cache at long context, and prefer hardware with high memory bandwidth for faster generation.

Best Local Reasoning LLMs in 2026

Reasoning models think step-by-step before answering, excelling at math, logic, and multi-step problems. Thanks to distillation, you no longer need a data center to run them: compact distills of DeepSeek R1 and models like QwQ-32B bring chain-of-thought reasoning to a single consumer GPU. Below are the open-weight reasoning models you can run locally, ranked by popularity — pick one to see the exact hardware that runs it.

109 Reasoning Models You Can Run Locally

DeepSeek R1

DeepSeek · 684.5B · runs from 192.1 GB

8.6M 13.5K

DeepSeek R1 is a groundbreaking reasoning model that uses reinforcement learning to develop chain-of-thought capabilities without relying on supervised fine-tuning. With 684.5 billion total parameters in a mixture-of-experts architecture (only 37 billion active per token), R1 achieves performance competitive with OpenAI's o1 on math, coding, and complex reasoning benchmarks while remaining fully open-weight. Running the full R1 locally is a serious undertaking, requiring well over 300 GB of VRAM at full precision, though quantized versions bring it within reach of multi-GPU setups. For users who want R1-level reasoning on more modest hardware, DeepSeek also released a family of distilled models that pack R1's reasoning patterns into smaller dense architectures.

109 Reasoning Models You Can Run Locally

DeepSeek R1

DeepSeek R1 0528 Qwen3 8B

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Qwen 32B

DeepSeek R1 Distill Qwen 1.5B

DeepSeek R1 0528

QwQ 32B

DeepSeek R1 Distill Llama 8B

DeepSeek R1 Distill Qwen 14B

Nemotron 3 Nano Omni 30B A3B Reasoning BF16

DeepSeek R1 Distill Qwen 7B

Qwythos 9B Claude Mythos 5 1M

Hermes 4 14B

Gemma 4 12B Agentic Fable5 Composer2.5 v2 3.5x Tau2

Nemotron Cascade 2 30B A3B

DeepSeek R1 Distill Qwen 1.5B

VibeThinker 3B

Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled

Ouro 1.4B

VulnLLM R 7B

Huihui Qwen3.6 35B A3B Claude 4.7 Opus Abliterated

GLM 5.2 W4AFP8

MN 12B Mag Mell R1

Gemma 4 12B Coder Fable5 Composer2.5 V1

QwQ 32B Preview

Mythos Nano

DeepSeek R1 Distill Qwen 32B Abliterated

Phi 4 Mini Reasoning

OneReason 0.8B Pretrain Competition

Nemotron Cascade 8B

Qwen3.6 35B A3B Claude 4.7 Opus Reasoning Distilled

Hermes 4.3 36B

Phi 4 Reasoning Plus

VulnLLM R 7B

Qwen3.6 27B AEON Ultimate Uncensored BF16

Qwen27b Abliterated Fable MTP

Phi 4 Reasoning

Qwen3.6 27B Uncensored HauhauCS Aggressive MTP

Qwopus3.6 27B Coder

Qwopus3.6 27B v2

Qwythos 9B v2

Trinity Large Thinking

Ouro 2.6B Thinking

DeepSeek R1 Zero

Qwen Marketing

Qwen2.5 Coder 3B Claude Opus 4.6 Distilled

Tri 21B Think

OFFELLIA Gemma 4 E4B 8B Claude 4.6 Opus Reasoning MTP

Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled

Qwen3.6 35B A3B Claude 4.6 Opus Reasoning Distilled

Qwopus3.5 9B v3

Hermes 4 70B

Qwen3 4B Gemini 3.1 Pro Reasoning Distilled

Ornith 1.0 35B AEON Ultimate Uncensored BF16

AI21 Jamba Reasoning 3B

Qwen35B Agent R2

INTELLECT 3

Qwen3.5 2B Claude 4.6 Opus Reasoning Distilled

Qwen3.5 4B Safety Thinking

Qwen3.5 4B Claude 4.6 Opus Reasoning Distilled

Claude OSS

Nemotron Research Reasoning Qwen 1.5B

Qwen3.5 35B A3B Claude 4.6 Opus Reasoning Distilled

Nemotron Labs Audex 2B

Gemma 4 12B IT AEON Abliterated K4 BF16

Nemotron Content Safety Reasoning 4B

Nemotron Labs Audex 30B A3B

Carnice V1 9B Hermes Agent Stage2 Merged

Qwen2.5 Coder 7B Bird Cot

Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER

Domyn Small v1.0

Turkish Gemma 9B T1

Parable Qwen3 4B Claude Fable 5

Qwopus3.5 4B Coder

ExtGemma4 40 5B

Qwen3.5 4B Claude Opus 4.6 Distilled Heretic

Grug 12B

Soren 1 Small

Grug 35B A3B