PrunedHub Qwen3.5 35B A3B 80pct vs Qwen3.5 35B A3B DFlash

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

PrunedHub Qwen3.5 35B A3B 80pct

GOBA-AI-Labs · 35B

Chat
Qwen3.5 35B A3B DFlash

z-lab · 35B

Chat

Specifications

PrunedHub Qwen3.5 35B A3B 80pctQwen3.5 35B A3B DFlash
Parameters35B35B
Context262K
ArchitectureDFlashDraftModel
LicenseApache 2.0MIT
Downloads578705
ReleasedFeb 2026Mar 2026

VRAM by Quantization: PrunedHub Qwen3.5 35B A3B 80pct vs Qwen3.5 35B A3B DFlash

QuantizationBitsPrunedHub Qwen3.5 35B A3B 80pct VRAMQwen3.5 35B A3B DFlash VRAM
BF1616.0077 GB70.3 GB
Q2_K3.4016.4 GB15.2 GB
Q3_K_M3.9018.8 GB17.4 GB
Q4_K_M4.8023.1 GB21.3 GB
Q5_K_M5.7027.4 GB25.3 GB
Q6_K6.6031.8 GB29.2 GB
Q8_08.0038.5 GB35.3 GB

Verdict

Qwen3.5 35B A3B DFlash needs less VRAM at Q4_K_M (21.3 GB vs 23.1 GB), so it fits on smaller GPUs. Qwen3.5 35B A3B DFlash is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, PrunedHub Qwen3.5 35B A3B 80pct or Qwen3.5 35B A3B DFlash?

At Q4_K_M, PrunedHub Qwen3.5 35B A3B 80pct needs 23.1 GB and Qwen3.5 35B A3B DFlash needs 21.3 GB, so Qwen3.5 35B A3B DFlash is the lighter option to run locally.

What is the difference between PrunedHub Qwen3.5 35B A3B 80pct and Qwen3.5 35B A3B DFlash?

PrunedHub Qwen3.5 35B A3B 80pct is a 35B model from GOBA-AI-Labs (Qwen family), while Qwen3.5 35B A3B DFlash is a 35B model from z-lab (Qwen family). Compare their VRAM requirements above to see which fits your GPU or Mac.