Question 1

Which needs less VRAM, Llama2 7B Chat Uncensored or Llama 2 70B HF?

Accepted Answer

At BF16, Llama2 7B Chat Uncensored needs 14.8 GB and Llama 2 70B HF needs 151.8 GB, so Llama2 7B Chat Uncensored is the lighter option to run locally.

Question 2

What is the difference between Llama2 7B Chat Uncensored and Llama 2 70B HF?

Accepted Answer

Llama2 7B Chat Uncensored is a 6.7B model from georgesung (Llama 2 family), while Llama 2 70B HF is a 69.0B model from Meta (Llama 2 family). Compare their VRAM requirements above to see which fits your GPU or Mac.

	Llama2 7B Chat Uncensored	Llama 2 70B HF
Parameters	6.7B	69.0B
Context	2K	—
Architecture	LlamaForCausalLM	—
License	Other	Llama 2 Community
Downloads	1.0K	15.4K
Released	May 2024	Apr 2024

Llama2 7B Chat Uncensored vs Llama 2 70B HF

Specifications

VRAM by Quantization: Llama2 7B Chat Uncensored vs Llama 2 70B HF

Verdict

Frequently Asked Questions