Qwen3.6 28B vs Cogito V1 Preview Qwen 32B

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Qwen3.6 28B

0xSero · 28.2B

Chat
Cogito V1 Preview Qwen 32B

deepcogito · 32B

Chat

Specifications

Qwen3.6 28BCogito V1 Preview Qwen 32B
Parameters28.2B32B
Context262K131K
ArchitectureQwen3_5MoeForCausalLMQwen2ForCausalLM
LicenseApache 2.0Apache 2.0
Downloads1.2K43.2K
ReleasedMay 2026Apr 2025

VRAM by Quantization: Qwen3.6 28B vs Cogito V1 Preview Qwen 32B

QuantizationBitsQwen3.6 28B VRAMCogito V1 Preview Qwen 32B VRAM
Q2_K3.4012.4 GB14.4 GB
Q3_K_M3.9014.2 GB16.4 GB
Q3_K_S3.5012.7 GB14.8 GB
Q4_04.0016.8 GB
Q4_K_M4.8017.3 GB20.0 GB
Q5_K_M5.7020.5 GB23.6 GB
Q6_K6.6023.7 GB27.2 GB
Q8_08.0028.6 GB32.8 GB

Verdict

Qwen3.6 28B needs less VRAM at Q4_K_M (17.3 GB vs 20.0 GB), so it fits on smaller GPUs. Qwen3.6 28B supports a longer context window (262K tokens). Cogito V1 Preview Qwen 32B is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, Qwen3.6 28B or Cogito V1 Preview Qwen 32B?

At Q4_K_M, Qwen3.6 28B needs 17.3 GB and Cogito V1 Preview Qwen 32B needs 20.0 GB, so Qwen3.6 28B is the lighter option to run locally.

Which has a longer context window, Qwen3.6 28B or Cogito V1 Preview Qwen 32B?

Qwen3.6 28B supports 262,144 tokens and Cogito V1 Preview Qwen 32B supports 131,072 tokens.

What is the difference between Qwen3.6 28B and Cogito V1 Preview Qwen 32B?

Qwen3.6 28B is a 28.2B model from 0xSero (Qwen family), while Cogito V1 Preview Qwen 32B is a 32B model from deepcogito (Qwen family). Compare their VRAM requirements above to see which fits your GPU or Mac.