Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill vs CodeQwen1.5 7B

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill

Jackrong · 9.7B

ChatReasoning
CodeQwen1.5 7B

Alibaba · 7.3B

ChatCode

Specifications

Qwen3.5 9B Gemini 3.1 Pro Reasoning DistillCodeQwen1.5 7B
Parameters9.7B7.3B
Context262K66K
ArchitectureQwen3_5ForConditionalGenerationQwen2ForCausalLM
LicenseApache 2.0Other
Downloads4991.6K
ReleasedMar 2026

VRAM by Quantization: Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill vs CodeQwen1.5 7B

QuantizationBitsQwen3.5 9B Gemini 3.1 Pro Reasoning Distill VRAMCodeQwen1.5 7B VRAM
Q2_K3.404.7 GB3.5 GB
Q3_K_M3.905.3 GB4.0 GB
Q3_K_S3.504.8 GB3.6 GB
Q4_04.004.1 GB
Q4_K_M4.806.4 GB4.8 GB
Q5_K_M5.707.5 GB5.6 GB
Q6_K6.608.5 GB6.4 GB
Q8_08.0010.2 GB7.7 GB

Verdict

CodeQwen1.5 7B needs less VRAM at Q4_K_M (4.8 GB vs 6.4 GB), so it fits on smaller GPUs. Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill supports a longer context window (262K tokens). CodeQwen1.5 7B is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill or CodeQwen1.5 7B?

At Q4_K_M, Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill needs 6.4 GB and CodeQwen1.5 7B needs 4.8 GB, so CodeQwen1.5 7B is the lighter option to run locally.

Which has a longer context window, Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill or CodeQwen1.5 7B?

Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill supports 262,144 tokens and CodeQwen1.5 7B supports 65,536 tokens.

What is the difference between Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill and CodeQwen1.5 7B?

Qwen3.5 9B Gemini 3.1 Pro Reasoning Distill is a 9.7B model from Jackrong (Qwen family), while CodeQwen1.5 7B is a 7.3B model from Alibaba (Qwen family). Compare their VRAM requirements above to see which fits your GPU or Mac.