How much VRAM do I need to run a Qwen 3.6 model?

The smallest Qwen 3.6 model, Qwen3.6 27B MTPLX Optimized Speed, runs from 2.7 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.

Which Qwen 3.6 models can I run on a 16 GB GPU?

22 of 25 Qwen 3.6 models fit in 16 GB of VRAM at some quantization, including Qwen3.6 35B A3B, Qwen3.6 27B, Qwen3.6 27B Uncensored Heretic v2 Native MTP Preserved.

What is the most popular Qwen 3.6 model to run locally?

Qwen3.6 35B A3B is the most downloaded Qwen 3.6 model in local-friendly quantized formats. It runs from 10.3 GB of VRAM.

How do Qwen 3.6 models score on benchmarks?

Qwen3.6 27B leads the family with an overall benchmark rating of 74.0/100, ranking #4 among 73 open models, while the top proprietary model, Claude Fable 5 Max, scores 89.9. See the comparison chart above for the full standings.

Qwen 3.6 Models — Hardware Requirements

14 Qwen 3.6 models from Alibaba and the community, from the smallest that runs in 2.7 GB of VRAM up to 36.0B parameters. Every row links to full quantization tables and GPU compatibility.

All Qwen 3.6 Models by Size

Model	Params	Runs from	Context	Publisher	Quant downloads
Qwen3.6 27B MTPLX Optimized Speed	4.7B	2.7 GB	262K	Youssofal	—
Qwen3.6 14B A3B FableVibes	13.8B	6.2 GB	262K	tvall43	72.7K
Qwen3.6 27B MTPLX Optimized	26.9B	12.2 GB	262K	Youssofal	—
Qwen3.6 27B OBLITERATED	26.9B	12.2 GB	262K	OBLITERATUS	—
Qwen3.6 27B DFlash	27B	11.8 GB	262K	z-lab	26.3K
Qwen3.6 27B PRISM PRO DQ	27B	12.6 GB	—	Ex0bit	—
Qwen3.6 27B Uncensored HauhauCS Aggressive MTP	27B	10.0 GB	—	AIOpsInSpace	—
Qwen3.6 27B MTP TQ3 4S	27B	12.6 GB	—	YTan2000	—
Qwen3.6 27B Uncensored Heretic v2 Native MTP Preserved	27.4B	12.4 GB	262K	llmfan46	85.7K
Qwen3.6 27B AEON Ultimate Uncensored BF16	27.4B	12.4 GB	262K	AEON-7	66.4K
ThinkingCap Qwen3.6 27B	27.4B	12.4 GB	262K	bottlecapai	—
Qwen3.6 27B Heretic2 Uncensored Finetune Thinking	27.4B	12.4 GB	262K	DavidAU	—
Qwen3.6 27B	27.8B	8.4 GB	262K	Alibaba	4.7M
Huihui Qwen3.6 27B Abliterated	27.8B	12.6 GB	262K	huihui-ai	—
Qwen3.6 27B Uncensored HauhauCS Aggressive Safetensor Benchmark	27.8B	12.6 GB	262K	DreamFast	—
Qwen3.6 28B	28.2B	12.4 GB	262K	0xSero	—
Qwen3.6 35B A3B DFlash	35B	15.2 GB	262K	z-lab	5.9K
Qwen3.6 35B A3b Crown Halo Mtp Dynamic	35B	16.4 GB	—	jcbtc	—
Qwen3.6 35B A3B Uncensored Heretic Native MTP Preserved	35.1B	15.3 GB	262K	llmfan46	57.4K
Qwen3.6 35B A3B	36.0B	10.3 GB	262K	Alibaba	9.8M
Qwen3.6 35B A3B Claude 4.6 Opus Reasoning Distilled	36.0B	15.7 GB	262K	hesamation	63.9K
Huihui Qwen3.6 35B A3B Claude 4.7 Opus Abliterated	36.0B	15.7 GB	262K	huihui-ai	5.1K
Qwen3.6 35B A3B Claude 4.7 Opus Reasoning Distilled	36.0B	15.7 GB	262K	lordx64	—
Qwen3.6 40B Claude 4.6 Opus Deckard Heretic Uncensored Thinking	39.5B	80.0 GB	262K	DavidAU	10.2K
Qwen3.6 40B Deckard MTP	40B	18.7 GB	—	PiehSoft	—

How Qwen 3.6 Compares — Benchmark Rating

Qwen3.6 27B is the highest-rated Qwen 3.6 model with an overall benchmark rating of 74.0/100 — #4 among 73 open models. The top proprietary model, Claude Fable 5 Max, scores 89.9. Click a model to see its full benchmark breakdown.

Claude Fable 5 Max · proprietary89.9

GPT 5.5 · proprietary89.2

GPT 5.6 Sol · proprietary89.2

Claude Fable 5 · proprietary88.6

Claude Opus 4.8 · proprietary88.1

GLM 5.282.7

Inkling79.2

DeepSeek V4 Pro74.3

Qwen3.6 27B74.0

DeepSeek V4 Flash73.2

Kimi K2.7 Code69.2

Composite of normalized public benchmark scores (methodology) · ■ Qwen 3.6 · ■ other models

Frequently Asked Questions

How much VRAM do I need to run a Qwen 3.6 model?: The smallest Qwen 3.6 model, Qwen3.6 27B MTPLX Optimized Speed, runs from 2.7 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Qwen 3.6 models can I run on a 16 GB GPU?: 22 of 25 Qwen 3.6 models fit in 16 GB of VRAM at some quantization, including Qwen3.6 35B A3B, Qwen3.6 27B, Qwen3.6 27B Uncensored Heretic v2 Native MTP Preserved.
What is the most popular Qwen 3.6 model to run locally?: Qwen3.6 35B A3B is the most downloaded Qwen 3.6 model in local-friendly quantized formats. It runs from 10.3 GB of VRAM.
How do Qwen 3.6 models score on benchmarks?: Qwen3.6 27B leads the family with an overall benchmark rating of 74.0/100, ranking #4 among 73 open models, while the top proprietary model, Claude Fable 5 Max, scores 89.9. See the comparison chart above for the full standings.