GPT-OSS Models — Hardware Requirements

7 GPT-OSS models from OpenAI and the community, from the smallest that runs in 6.3 GB of VRAM up to 120.4B parameters. Every row links to full quantization tables and GPU compatibility.

All GPT-OSS Models by Size

ModelParamsContext
Huihui GPT OSS 20B BF16 Abliterated20.9B131K
GPT OSS 20B Heretic20.9B131K
GPT OSS 20B21.5B131K
GPT OSS Safeguard 20B21.5B131K
GPT OSS 20B Heretic Ara v321.5B131K
GPT OSS 20B RichardErkhov Heresy21.5B131K
GPT OSS 120B120.4B131K

How GPT-OSS Compares — Benchmark Rating

GPT OSS 120B is the highest-rated GPT-OSS model with an overall benchmark rating of 46.3/100 — #42 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.

GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
Composite of normalized public benchmark scores (methodology) · GPT-OSS · other models

Frequently Asked Questions

How much VRAM do I need to run a GPT-OSS model?
The smallest GPT-OSS model, GPT OSS 20B, runs from 6.3 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which GPT-OSS models can I run on a 16 GB GPU?
6 of 7 GPT-OSS models fit in 16 GB of VRAM at some quantization, including GPT OSS 20B, Huihui GPT OSS 20B BF16 Abliterated, GPT OSS Safeguard 20B.
What is the most popular GPT-OSS model to run locally?
GPT OSS 120B is the most downloaded GPT-OSS model in local-friendly quantized formats. It runs from 51.6 GB of VRAM.
How do GPT-OSS models score on benchmarks?
GPT OSS 120B leads the family with an overall benchmark rating of 46.3/100, ranking #42 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.