Question 1

Which needs less VRAM, Yi 34B Chat or Yi 6B Chat?

Accepted Answer

At Q4_K_M, Yi 34B Chat needs 21.4 GB and Yi 6B Chat needs 4.1 GB, so Yi 6B Chat is the lighter option to run locally.

Question 2

Which has a longer context window, Yi 34B Chat or Yi 6B Chat?

Accepted Answer

Yi 34B Chat supports 4,096 tokens and Yi 6B Chat supports 4,096 tokens.

Question 3

What is the difference between Yi 34B Chat and Yi 6B Chat?

Accepted Answer

Yi 34B Chat is a 34.4B model from 01.AI (Yi family), while Yi 6B Chat is a 6.1B model from 01.AI (Yi family). Compare their VRAM requirements above to see which fits your GPU or Mac.

	Yi 34B Chat	Yi 6B Chat
Parameters	34.4B	6.1B
Context	4K	4K
Architecture	LlamaForCausalLM	LlamaForCausalLM
License	Apache 2.0	Apache 2.0
Downloads	75.9K	65.1K
Released	—	—

Quantization	Bits	Yi 34B Chat VRAM	Yi 6B Chat VRAM
Q2_K	3.40	15.4 GB	3.0 GB
Q3_K_M	3.90	17.6 GB	3.4 GB
Q3_K_S	3.50	15.8 GB	3.1 GB
Q4_0	4.00	18 GB	3.5 GB
Q4_K_M	4.80	21.4 GB	4.1 GB
Q5_K_M	5.70	25.3 GB	4.8 GB
Q6_K	6.60	29.2 GB	5.4 GB
Q8_0	8.00	35.2 GB	6.5 GB

Yi 34B Chat vs Yi 6B Chat

Specifications

VRAM by Quantization: Yi 34B Chat vs Yi 6B Chat

Verdict

Frequently Asked Questions