Qwen 3.5 Models — Hardware Requirements

25 Qwen 3.5 models from Alibaba and the community, from the smallest that runs in 0.7 GB of VRAM up to 125.1B parameters. Every row links to full quantization tables and GPU compatibility.

All Qwen 3.5 Models by Size

ModelParamsContext
Qwen3.5 4B DFlash537M262K
Josiefied Qwen3.5 0.8B Gabliterated V1853M262K
Qwen3.5 0.8B873M262K
Qwen3.5 9B DFlash1.0B262K
Qwen3.5 2B Text Only1.9B262K
Qwen3.5 2B Claude 4.6 Opus Reasoning Distilled2.3B262K
Qwen3.5 4B PTBR4B
Qwen3.5 4B Safety Thinking4.2B262K
Qwen3.5 4B Claude Opus 4.6 Distilled Heretic4.5B262K
Qwen3.5 4B4.7B262K
Qwen3.5 4B Claude 4.6 Opus Reasoning Distilled4.7B262K
Qwen3.5 4B MiniFantasy MTP4.7B262K
Qwen3.5 9B Abliterated9.0B262K
Qwen3.5 9B Uncensored9B
Qwen3.5 9B Humanize DPO Round29B
GrepSeek Qwen3.5 9B GRPO9.4B262K
Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled9.7B262K
Qwen3.5 9B9.7B262K
Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill9.7B262K
Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled Heretic v227.4B262K
Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled27.8B262K
Qwen3.5 35B A3B DFlash35B262K
PrunedHub Qwen3.5 35B A3B 80pct35B
Qwen3.5 35B A3B Claude 4.6 Opus Reasoning Distilled36.0B262K
Qwen3.5 122B A10B125.1B262K

Frequently Asked Questions

How much VRAM do I need to run a Qwen 3.5 model?
The smallest Qwen 3.5 model, Qwen3.5 0.8B, runs from 0.7 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Qwen 3.5 models can I run on a 16 GB GPU?
19 of 25 Qwen 3.5 models fit in 16 GB of VRAM at some quantization, including Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled, Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled, Qwen3.5 4B.
What is the most popular Qwen 3.5 model to run locally?
Qwen3.5 122B A10B is the most downloaded Qwen 3.5 model in local-friendly quantized formats. It runs from 53.5 GB of VRAM.