Apertus Models — Hardware Requirements

3 Apertus models from swiss-ai and the community, from the smallest that runs in 2.8 GB of VRAM up to 70B parameters. Every row links to full quantization tables and GPU compatibility.

All Apertus Models by Size

ModelParamsContext
Apertus 8B Instruct 25098B66K
Apertus 8B MeditronFO8.1B66K
Apertus 70B Instruct 250970B66K

Frequently Asked Questions

How much VRAM do I need to run a Apertus model?
The smallest Apertus model, Apertus 8B Instruct 2509, runs from 2.8 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Apertus models can I run on a 16 GB GPU?
2 of 3 Apertus models fit in 16 GB of VRAM at some quantization, including Apertus 8B Instruct 2509, Apertus 8B MeditronFO.
What is the most popular Apertus model to run locally?
Apertus 8B Instruct 2509 is the most downloaded Apertus model in local-friendly quantized formats. It runs from 2.8 GB of VRAM.