Qwen 3 Models — Hardware Requirements

48 Qwen 3 models from Alibaba and the community, from the smallest that runs in 0.3 GB of VRAM up to 480.2B parameters. Every row links to full quantization tables and GPU compatibility.

All Qwen 3 Models by Size

ModelParamsContext
Qwen3 4B Domino B16588M41K
Qwen3 0.6B Base596M33K
Qwen3 0.6B Heretic Abliterated Uncensored596M41K
Distil Qwen3 0.6B Text2sql596M41K
Qwen3 0.6B0.6B
Qwen3 0.6B752M41K
Qwen3 14B PARO1.6B41K
Qwen3 1.7B Abliterated1.7B
Qwen3 1.7B1.7B
Qwen3 1.7B Base1.7B33K
Qwen3 1.7B2.0B41K
Qwen3 4B Z Image Engineer V44B
Qwen3 4B Gemini 3.1 Pro Reasoning Distilled4B262K
Qwen3 Code Reasoning 4B4B262K
Qwen3 4B4.0B41K
Qwen3 4B Instruct 25074.0B262K
Qwen3 4B Thinking 25074.0B262K
Qwen3 4B Base4.0B33K
Qwen3 4B Instruct 2507 Heretic4.0B262K
Huihui Qwen3 4B Abliterated v24.0B41K
Qwen3 4B Heretic4.0B41K
Qwen3 4B Abliterated4.0B
Qwen3 4B Hindi Instruct v24.0B262K
Qwen3 8B8B
Qwen3 8B8.2B41K
Qwen3 8B Base8.2B33K
Qwen3Guard Gen 8B8.2B33K
Huihui Qwen3 8B Abliterated v28.2B41K
Josiefied Qwen3 8B Abliterated V18.2B41K
Qwen3 8B Abliterated8.2B
Qwen3 14B14.8B41K
Qwen3 14B Base14.8B33K
Qwen3 30B A3B Instruct 250730.5B262K
Qwen3 Coder 30B A3B Instruct30.5B262K
Qwen3 30B A3B30.5B41K
Huihui Qwen3 Coder 30B A3B Instruct Abliterated30.5B262K
Qwen3 30B A3B Thinking 250730.5B262K
Qwen3 30B A3B Base30.5B33K
Qwen3 32B32.8B41K
Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER42.4B262K
Qwen3 Coder Next79.7B262K
Qwen3 Next 80B A3B Thinking81.3B262K
Qwen3 Next 80B A3B Instruct81.3B262K
Qwen3 235B A22B Thinking 2507235.1B262K
Qwen3 235B A22B235.1B41K
Qwen3 235B A22B Instruct 2507235.1B262K
Qwen3 Nemotron 235B A22B GenRM 2603235.1B262K
Qwen3 Coder 480B A35B Instruct480.2B262K

How Qwen 3 Compares — Benchmark Rating

Qwen3 Next 80B A3B Instruct is the highest-rated Qwen 3 model with an overall benchmark rating of 64.4/100 — #9 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.

GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
Composite of normalized public benchmark scores (methodology) · Qwen 3 · other models

Frequently Asked Questions

How much VRAM do I need to run a Qwen 3 model?
The smallest Qwen 3 model, Qwen3 0.6B, runs from 0.3 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Qwen 3 models can I run on a 16 GB GPU?
39 of 48 Qwen 3 models fit in 16 GB of VRAM at some quantization, including Qwen3 14B, Qwen3 30B A3B Instruct 2507, Qwen3 32B.
What is the most popular Qwen 3 model to run locally?
Qwen3 Coder Next is the most downloaded Qwen 3 model in local-friendly quantized formats. It runs from 22.3 GB of VRAM.
How do Qwen 3 models score on benchmarks?
Qwen3 Next 80B A3B Instruct leads the family with an overall benchmark rating of 64.4/100, ranking #9 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.