Josiefied Qwen3.5 0.8B Gabliterated V1 vs Qwen3.5 9B DFlash

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Josiefied Qwen3.5 0.8B Gabliterated V1

Goekdeniz-Guelmez · 853M

Chat
Qwen3.5 9B DFlash

z-lab · 1.0B

Chat

Specifications

Josiefied Qwen3.5 0.8B Gabliterated V1Qwen3.5 9B DFlash
Parameters853M1.0B
Context262K262K
ArchitectureQwen3_5ForConditionalGenerationDFlashDraftModel
LicenseMIT
Downloads61918.8K
ReleasedMar 2026Apr 2026

VRAM by Quantization: Josiefied Qwen3.5 0.8B Gabliterated V1 vs Qwen3.5 9B DFlash

QuantizationBitsJosiefied Qwen3.5 0.8B Gabliterated V1 VRAMQwen3.5 9B DFlash VRAM
Q2_K3.400.7 GB
Q3_K_L4.100.8 GB0.9 GB
Q3_K_M3.900.8 GB
Q3_K_S3.500.7 GB
Q4_K_M4.800.9 GB1.0 GB
Q4_K_S4.500.8 GB
Q5_K_M5.701.0 GB1.1 GB
Q5_K_S5.500.9 GB
Q6_K6.601.1 GB1.2 GB
Q8_08.001.2 GB1.4 GB

Verdict

Josiefied Qwen3.5 0.8B Gabliterated V1 needs less VRAM at Q4_K_M (0.9 GB vs 1.0 GB), so it fits on smaller GPUs. Qwen3.5 9B DFlash is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, Josiefied Qwen3.5 0.8B Gabliterated V1 or Qwen3.5 9B DFlash?

At Q4_K_M, Josiefied Qwen3.5 0.8B Gabliterated V1 needs 0.9 GB and Qwen3.5 9B DFlash needs 1.0 GB, so Josiefied Qwen3.5 0.8B Gabliterated V1 is the lighter option to run locally.

Which has a longer context window, Josiefied Qwen3.5 0.8B Gabliterated V1 or Qwen3.5 9B DFlash?

Josiefied Qwen3.5 0.8B Gabliterated V1 supports 262,144 tokens and Qwen3.5 9B DFlash supports 262,144 tokens.

What is the difference between Josiefied Qwen3.5 0.8B Gabliterated V1 and Qwen3.5 9B DFlash?

Josiefied Qwen3.5 0.8B Gabliterated V1 is a 853M model from Goekdeniz-Guelmez (Qwen family), while Qwen3.5 9B DFlash is a 1.0B model from z-lab (Qwen family). Compare their VRAM requirements above to see which fits your GPU or Mac.