Question 1

Which needs less VRAM, Llama 3.1 8B Instruct or Llama 3.1 8B Lexi Uncensored v2?

Accepted Answer

At Q4_K_M, Llama 3.1 8B Instruct needs 5.3 GB and Llama 3.1 8B Lexi Uncensored v2 needs 5.4 GB, so Llama 3.1 8B Instruct is the lighter option to run locally.

Question 2

Which has a longer context window, Llama 3.1 8B Instruct or Llama 3.1 8B Lexi Uncensored v2?

Accepted Answer

Llama 3.1 8B Instruct supports 131,072 tokens and Llama 3.1 8B Lexi Uncensored v2 supports 131,072 tokens.

Question 3

What is the difference between Llama 3.1 8B Instruct and Llama 3.1 8B Lexi Uncensored v2?

Accepted Answer

Llama 3.1 8B Instruct is a 8.0B model from Meta (Llama 3 family), while Llama 3.1 8B Lexi Uncensored v2 is a 8.0B model from Orenguteng (Llama 3 family). Compare their VRAM requirements above to see which fits your GPU or Mac.

	Llama 3.1 8B Instruct	Llama 3.1 8B Lexi Uncensored v2
Parameters	8.0B	8.0B
Context	131K	131K
Architecture	—	LlamaForCausalLM
License	Llama 3.1 Community	Llama 3.1 Community
Downloads	11.3M	29.7K
Released	Sep 2024	Sep 2024

Quantization	Bits	Llama 3.1 8B Instruct VRAM	Llama 3.1 8B Lexi Uncensored v2 VRAM
Q2_K	3.40	3.8 GB	4.0 GB
Q3_K_M	3.90	4.3 GB	4.5 GB
Q3_K_S	3.50	3.9 GB	4.1 GB
Q4_0	4.00	4.4 GB	—
Q4_K_M	4.80	5.3 GB	5.4 GB
Q5_K_M	5.70	6.3 GB	6.3 GB
Q6_K	6.60	7.3 GB	7.2 GB
Q8_0	8.00	8.8 GB	8.6 GB

Llama 3.1 8B Instruct vs Llama 3.1 8B Lexi Uncensored v2

Specifications

VRAM by Quantization: Llama 3.1 8B Instruct vs Llama 3.1 8B Lexi Uncensored v2

Verdict

Frequently Asked Questions