Llama 3.3 70B Instruct Abliterated vs Finance Llama3 8B

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Finance Llama3 8B

instruction-pretrain · 8.0B

Chat

Specifications

Llama 3.3 70B Instruct AbliteratedFinance Llama3 8B
Parameters70.6B8.0B
Context131K8K
ArchitectureLlamaForCausalLMLlamaForCausalLM
Licensellama3.3Llama 3 Community
Downloads4.3K1.9K
ReleasedDec 2024Mar 2026

VRAM by Quantization: Llama 3.3 70B Instruct Abliterated vs Finance Llama3 8B

QuantizationBitsLlama 3.3 70B Instruct Abliterated VRAMFinance Llama3 8B VRAM
Q2_K3.4031.0 GB
Q3_K_M3.9035.4 GB
Q3_K_S3.5031.8 GB
Q4_04.0036.3 GB
Q4_K_M4.8043.3 GB
Q5_K_M5.7051.2 GB
Q6_K6.6059.2 GB
Q8_08.0071.5 GB

Verdict

Llama 3.3 70B Instruct Abliterated supports a longer context window (131K tokens). Llama 3.3 70B Instruct Abliterated is the more widely downloaded of the two.

Frequently Asked Questions

Which has a longer context window, Llama 3.3 70B Instruct Abliterated or Finance Llama3 8B?

Llama 3.3 70B Instruct Abliterated supports 131,072 tokens and Finance Llama3 8B supports 8,192 tokens.

What is the difference between Llama 3.3 70B Instruct Abliterated and Finance Llama3 8B?

Llama 3.3 70B Instruct Abliterated is a 70.6B model from huihui-ai (Llama 3 family), while Finance Llama3 8B is a 8.0B model from instruction-pretrain (Llama 3 family). Compare their VRAM requirements above to see which fits your GPU or Mac.