Phi 3.5 MoE Instruct vs Phi 3 Small 8k Instruct

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Phi 3.5 MoE Instruct

Microsoft · 41.9B

ChatCode
Phi 3 Small 8k Instruct

Microsoft · 7.4B

ChatCode

Specifications

Phi 3.5 MoE InstructPhi 3 Small 8k Instruct
Parameters41.9B7.4B
Context131K8K
ArchitecturePhiMoEForCausalLMPhi3SmallForCausalLM
LicenseMITMIT
Downloads138.1K17.2K
Released

VRAM by Quantization: Phi 3.5 MoE Instruct vs Phi 3 Small 8k Instruct

QuantizationBitsPhi 3.5 MoE Instruct VRAMPhi 3 Small 8k Instruct VRAM
BF1616.0084.3 GB15.3 GB

Verdict

Phi 3 Small 8k Instruct needs less VRAM at BF16 (15.3 GB vs 84.3 GB), so it fits on smaller GPUs. Phi 3.5 MoE Instruct supports a longer context window (131K tokens). Phi 3.5 MoE Instruct is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, Phi 3.5 MoE Instruct or Phi 3 Small 8k Instruct?

At BF16, Phi 3.5 MoE Instruct needs 84.3 GB and Phi 3 Small 8k Instruct needs 15.3 GB, so Phi 3 Small 8k Instruct is the lighter option to run locally.

Which has a longer context window, Phi 3.5 MoE Instruct or Phi 3 Small 8k Instruct?

Phi 3.5 MoE Instruct supports 131,072 tokens and Phi 3 Small 8k Instruct supports 8,192 tokens.

What is the difference between Phi 3.5 MoE Instruct and Phi 3 Small 8k Instruct?

Phi 3.5 MoE Instruct is a 41.9B model from Microsoft (Phi 3 family), while Phi 3 Small 8k Instruct is a 7.4B model from Microsoft (Phi 3 family). Compare their VRAM requirements above to see which fits your GPU or Mac.