GPT OSS 20B Heretic vs GPT OSS 20B Heretic Ara v3
Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.
Specifications
| GPT OSS 20B Heretic | GPT OSS 20B Heretic Ara v3 | |
|---|---|---|
| Parameters | 20.9B | 1.8B |
| Context | 131K | 131K |
| Architecture | GptOssForCausalLM | GptOssForCausalLM |
| License | Apache 2.0 | Apache 2.0 |
| Downloads | 913 | 1.1K |
| Released | Nov 2025 | Mar 2026 |
VRAM by Quantization: GPT OSS 20B Heretic vs GPT OSS 20B Heretic Ara v3
| Quantization | Bits | GPT OSS 20B Heretic VRAM | GPT OSS 20B Heretic Ara v3 VRAM |
|---|---|---|---|
| IQ4_NL | 4.50 | 12.1 GB | 1.4 GB |
| Q5_1 | 5.50 | 14.8 GB | 1.6 GB |
| Q8_0 | 8.00 | 21.3 GB | 2.2 GB |
Verdict
GPT OSS 20B Heretic Ara v3 needs less VRAM at IQ4_NL (1.4 GB vs 12.1 GB), so it fits on smaller GPUs. GPT OSS 20B Heretic Ara v3 is the more widely downloaded of the two.
Frequently Asked Questions
- Which needs less VRAM, GPT OSS 20B Heretic or GPT OSS 20B Heretic Ara v3?
At IQ4_NL, GPT OSS 20B Heretic needs 12.1 GB and GPT OSS 20B Heretic Ara v3 needs 1.4 GB, so GPT OSS 20B Heretic Ara v3 is the lighter option to run locally.
- Which has a longer context window, GPT OSS 20B Heretic or GPT OSS 20B Heretic Ara v3?
GPT OSS 20B Heretic supports 131,072 tokens and GPT OSS 20B Heretic Ara v3 supports 131,072 tokens.
- What is the difference between GPT OSS 20B Heretic and GPT OSS 20B Heretic Ara v3?
GPT OSS 20B Heretic is a 20.9B model from p-e-w (GPT-OSS family), while GPT OSS 20B Heretic Ara v3 is a 1.8B model from p-e-w (GPT-OSS family). Compare their VRAM requirements above to see which fits your GPU or Mac.