Question 1

How much VRAM do I need to run a Apertus model?

Accepted Answer

The smallest Apertus model, Apertus 8B Instruct 2509, runs from 2.8 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.

Question 2

Which Apertus models can I run on a 16 GB GPU?

Accepted Answer

2 of 3 Apertus models fit in 16 GB of VRAM at some quantization, including Apertus 8B Instruct 2509, Apertus 8B MeditronFO.

Question 3

What is the most popular Apertus model to run locally?

Accepted Answer

Apertus 8B Instruct 2509 is the most downloaded Apertus model in local-friendly quantized formats. It runs from 2.8 GB of VRAM.

Model	Params	Runs from	Context	Publisher	Quant downloads
Apertus 8B Instruct 2509	8B	2.8 GB	66K	swiss-ai	2.4K
Apertus 8B MeditronFO	8.1B	4.0 GB	66K	EPFLiGHT	—
Apertus 70B Instruct 2509	70B	30.7 GB	66K	swiss-ai	—

Apertus Models — Hardware Requirements

All Apertus Models by Size

Frequently Asked Questions