Alibaba·Qwen·Qwen3MoeForCausalLM

Qwen3 Coder 480B A35B Instruct — Hardware Requirements & GPU Compatibility

ChatCode

Qwen3 Coder 480B A35B Instruct is Alibaba's largest code-specialized model, a massive 480.2-billion-parameter mixture-of-experts system with roughly 35 billion parameters active per token. This is the most powerful open-weight coding model in the Qwen3 family, designed for professional-grade code generation, analysis, and software engineering tasks. Running this model locally is a serious undertaking that requires multi-GPU server-class hardware with several hundred gigabytes of combined VRAM. For users with access to such infrastructure, it offers exceptional code quality and understanding that rivals leading proprietary coding assistants, all while keeping data and computation entirely under local control.

76.6K downloads 1.3K likesAug 2025262K context

Specifications

Publisher
Alibaba
Family
Qwen
Parameters
480.2B
Architecture
Qwen3MoeForCausalLM
Context Length
262,144 tokens
Vocabulary Size
151,936
Release Date
2025-08-21
License
Apache 2.0

Get Started

How Much VRAM Does Qwen3 Coder 480B A35B Instruct Need?

Select a quantization to see compatible GPUs below.

QuantizationBitsVRAM
IQ2_XS2.40144.6 GB
IQ2_S2.50150.6 GB
IQ3_XXS3.10186.6 GB
IQ3_XS3.30198.6 GB
Q2_K3.40204.6 GB
Q3_K_S3.50210.6 GB
IQ3_M3.60216.6 GB
Q3_K_M3.90234.6 GB
Q4_04.00240.6 GB
Q3_K_L4.10246.6 GB
IQ4_XS4.30258.6 GB
Q4_14.50270.6 GB
Q4_K_S4.50270.6 GB
IQ4_NL4.50270.6 GB
Q4_K_M4.80288.6 GB
Q5_K_S5.50330.7 GB
Q5_K_M5.70342.7 GB
Q6_K6.60396.7 GB
Q8_08.00480.7 GB

Which GPUs Can Run Qwen3 Coder 480B A35B Instruct?

Q4_K_M · 288.6 GB

Qwen3 Coder 480B A35B Instruct (Q4_K_M) requires 288.6 GB of VRAM to load the model weights. For comfortable inference with headroom for KV cache and system overhead, 376+ GB is recommended. Using the full 262K context window can add up to 33.0 GB, bringing total usage to 321.7 GB. No single GPU has enough memory — multi-GPU or cluster setups are needed.

Which Devices Can Run Qwen3 Coder 480B A35B Instruct?

Q4_K_M · 288.6 GB

2 devices with unified memory can run Qwen3 Coder 480B A35B Instruct, including NVIDIA DGX H100, NVIDIA DGX A100 640GB.

Related Models

Frequently Asked Questions

How much VRAM does Qwen3 Coder 480B A35B Instruct need?

Qwen3 Coder 480B A35B Instruct requires 288.6 GB of VRAM at Q4_K_M, or 480.7 GB at Q8_0. Full 262K context adds up to 33.0 GB (321.7 GB total).

VRAM = Weights + KV Cache + Overhead

Weights = 480.2B × 4.8 bits ÷ 8 = 288.1 GB

KV Cache + Overhead 0.5 GB (at 2K context + ~0.3 GB framework)

KV Cache + Overhead 33.6 GB (at full 262K context)

VRAM usage by quantization

288.6 GB
321.7 GB

Learn more about VRAM estimation →

Can NVIDIA GeForce RTX 5090 run Qwen3 Coder 480B A35B Instruct?

No — Qwen3 Coder 480B A35B Instruct requires at least 144.6 GB at IQ2_XS, which exceeds the NVIDIA GeForce RTX 5090's 32 GB of VRAM.

What's the best quantization for Qwen3 Coder 480B A35B Instruct?

For Qwen3 Coder 480B A35B Instruct, Q4_K_M (288.6 GB) offers the best balance of quality and VRAM usage. Q5_K_S (330.7 GB) provides better quality if you have the VRAM. The smallest option is IQ2_XS at 144.6 GB.

VRAM requirement by quantization

IQ2_XS
144.6 GB
Q3_K_S
210.6 GB
Q3_K_L
246.6 GB
Q4_K_M
288.6 GB
Q5_K_S
330.7 GB
Q8_0
480.7 GB

★ Recommended — best balance of quality and VRAM usage.

Learn more about quantization →

Can I run Qwen3 Coder 480B A35B Instruct on a Mac?

Qwen3 Coder 480B A35B Instruct requires at least 144.6 GB at IQ2_XS, which exceeds the unified memory of most consumer Macs. You would need a Mac Studio or Mac Pro with a high-memory configuration.

Can I run Qwen3 Coder 480B A35B Instruct locally?

Yes — Qwen3 Coder 480B A35B Instruct can run locally on consumer hardware. At Q4_K_M quantization it needs 288.6 GB of VRAM. Popular tools include Ollama, LM Studio, and llama.cpp.

What's the download size of Qwen3 Coder 480B A35B Instruct?

At Q4_K_M, the download is about 288.09 GB. The full-precision Q8_0 version is 480.15 GB. The smallest option (IQ2_XS) is 144.05 GB.