Llama3 8B 1.58 100B Tokens vs Hermes 2 Pro Llama 3 8B

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Llama3 8B 1.58 100B Tokens

HF1BitLLM · 2.8B

Chat
Hermes 2 Pro Llama 3 8B

Nous Research · 8.0B

Chat

Specifications

Llama3 8B 1.58 100B TokensHermes 2 Pro Llama 3 8B
Parameters2.8B8.0B
Context8K8K
ArchitectureLlamaForCausalLMLlamaForCausalLM
LicenseLlama 3 Community
Downloads1.6K15.7K
ReleasedSep 2024Sep 2024

VRAM by Quantization: Llama3 8B 1.58 100B Tokens vs Hermes 2 Pro Llama 3 8B

QuantizationBitsLlama3 8B 1.58 100B Tokens VRAMHermes 2 Pro Llama 3 8B VRAM
BF1616.006.2 GB
FP1616.0016.6 GB

Verdict

Hermes 2 Pro Llama 3 8B is the more widely downloaded of the two.

Frequently Asked Questions

Which has a longer context window, Llama3 8B 1.58 100B Tokens or Hermes 2 Pro Llama 3 8B?

Llama3 8B 1.58 100B Tokens supports 8,192 tokens and Hermes 2 Pro Llama 3 8B supports 8,192 tokens.

What is the difference between Llama3 8B 1.58 100B Tokens and Hermes 2 Pro Llama 3 8B?

Llama3 8B 1.58 100B Tokens is a 2.8B model from HF1BitLLM (Llama 3 family), while Hermes 2 Pro Llama 3 8B is a 8.0B model from Nous Research (Llama 3 family). Compare their VRAM requirements above to see which fits your GPU or Mac.