Question 1

Which needs less VRAM, Llama 2 Ko 7B or Llama2 7B Chat Uncensored?

Accepted Answer

At BF16, Llama 2 Ko 7B needs 15.1 GB and Llama2 7B Chat Uncensored needs 14.8 GB, so Llama2 7B Chat Uncensored is the lighter option to run locally.

Question 2

Which has a longer context window, Llama 2 Ko 7B or Llama2 7B Chat Uncensored?

Accepted Answer

Llama 2 Ko 7B supports 2,048 tokens and Llama2 7B Chat Uncensored supports 2,048 tokens.

Question 3

What is the difference between Llama 2 Ko 7B and Llama2 7B Chat Uncensored?

Accepted Answer

Llama 2 Ko 7B is a 6.9B model from beomi (Llama 2 family), while Llama2 7B Chat Uncensored is a 6.7B model from georgesung (Llama 2 family). Compare their VRAM requirements above to see which fits your GPU or Mac.

	Llama 2 Ko 7B	Llama2 7B Chat Uncensored
Parameters	6.9B	6.7B
Context	2K	2K
Architecture	LlamaForCausalLM	LlamaForCausalLM
License	—	Other
Downloads	1.4K	1.0K
Released	Dec 2023	May 2024

Llama 2 Ko 7B vs Llama2 7B Chat Uncensored

Specifications

VRAM by Quantization: Llama 2 Ko 7B vs Llama2 7B Chat Uncensored

Verdict

Frequently Asked Questions