Question 1

Which needs less VRAM, Yi 34B Chat or Yi 9B?

Accepted Answer

At Q4_K_M, Yi 34B Chat needs 21.4 GB and Yi 9B needs 5.8 GB, so Yi 9B is the lighter option to run locally.

Question 2

Which has a longer context window, Yi 34B Chat or Yi 9B?

Accepted Answer

Yi 34B Chat supports 4,096 tokens and Yi 9B supports 4,096 tokens.

Question 3

What is the difference between Yi 34B Chat and Yi 9B?

Accepted Answer

Yi 34B Chat is a 34.4B model from 01.AI (Yi family), while Yi 9B is a 8.8B model from 01.AI (Yi family). Compare their VRAM requirements above to see which fits your GPU or Mac.

	Yi 34B Chat	Yi 9B
Parameters	34.4B	8.8B
Context	4K	4K
Architecture	LlamaForCausalLM	LlamaForCausalLM
License	Apache 2.0	Apache 2.0
Downloads	75.9K	8.7K
Released	—	—

Quantization	Bits	Yi 34B Chat VRAM	Yi 9B VRAM
Q2_K	3.40	15.4 GB	4.3 GB
Q3_K_M	3.90	17.6 GB	4.8 GB
Q3_K_S	3.50	15.8 GB	4.4 GB
Q4_0	4.00	18 GB	—
Q4_K_M	4.80	21.4 GB	5.8 GB
Q5_K_M	5.70	25.3 GB	6.8 GB
Q6_K	6.60	29.2 GB	7.8 GB
Q8_0	8.00	35.2 GB	9.3 GB

Yi 34B Chat vs Yi 9B

Specifications

VRAM by Quantization: Yi 34B Chat vs Yi 9B

Verdict

Frequently Asked Questions