Qwen 3.5 Models — Hardware Requirements
25 Qwen 3.5 models from Alibaba and the community, from the smallest that runs in 0.7 GB of VRAM up to 125.1B parameters. Every row links to full quantization tables and GPU compatibility.
All Qwen 3.5 Models by Size
Frequently Asked Questions
- How much VRAM do I need to run a Qwen 3.5 model?
- The smallest Qwen 3.5 model, Qwen3.5 0.8B, runs from 0.7 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
- Which Qwen 3.5 models can I run on a 16 GB GPU?
- 19 of 25 Qwen 3.5 models fit in 16 GB of VRAM at some quantization, including Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled, Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled, Qwen3.5 4B.
- What is the most popular Qwen 3.5 model to run locally?
- Qwen3.5 122B A10B is the most downloaded Qwen 3.5 model in local-friendly quantized formats. It runs from 53.5 GB of VRAM.