Yi 9B vs Internlm3 8B Instruct

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Yi 9B

01.AI · 8.8B

Chat
Internlm3 8B Instruct

InternLM · 8.8B

Chat

Specifications

Yi 9BInternlm3 8B Instruct
Parameters8.8B8.8B
Context4K33K
ArchitectureLlamaForCausalLMInternLM3ForCausalLM
LicenseApache 2.0Apache 2.0
Downloads8.7K89.5K
ReleasedFeb 2025

VRAM by Quantization: Yi 9B vs Internlm3 8B Instruct

QuantizationBitsYi 9B VRAMInternlm3 8B Instruct VRAM
Q2_K3.404.3 GB4.1 GB
Q3_K_M3.904.8 GB4.7 GB
Q3_K_S3.504.4 GB4.3 GB
Q4_04.004.8 GB
Q4_K_M4.805.8 GB5.7 GB
Q5_K_M5.706.8 GB6.7 GB
Q6_K6.607.8 GB7.7 GB
Q8_08.009.3 GB9.2 GB

Verdict

Internlm3 8B Instruct needs less VRAM at Q4_K_M (5.7 GB vs 5.8 GB), so it fits on smaller GPUs. Internlm3 8B Instruct supports a longer context window (33K tokens). Internlm3 8B Instruct is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, Yi 9B or Internlm3 8B Instruct?

At Q4_K_M, Yi 9B needs 5.8 GB and Internlm3 8B Instruct needs 5.7 GB, so Internlm3 8B Instruct is the lighter option to run locally.

Which has a longer context window, Yi 9B or Internlm3 8B Instruct?

Yi 9B supports 4,096 tokens and Internlm3 8B Instruct supports 32,768 tokens.

What is the difference between Yi 9B and Internlm3 8B Instruct?

Yi 9B is a 8.8B model from 01.AI (Yi family), while Internlm3 8B Instruct is a 8.8B model from InternLM (InternLM family). Compare their VRAM requirements above to see which fits your GPU or Mac.