ERNIE Models — Hardware Requirements
4 ERNIE models from Baidu and the community, from the smallest that runs in 0.5 GB of VRAM up to 21B parameters. Every row links to full quantization tables and GPU compatibility.
All ERNIE Models by Size
| Model | Params | Runs from | Context | Publisher | Quant downloads |
|---|---|---|---|---|---|
| ERNIE 4.5 0.3B PT | 361M | 0.5 GB | 131K | ||
| ERNIE 4.5 0.3B Paddle | 361M | 1.0 GB | 131K | ||
| ErniePEUnleashed | 3.4B | 1.9 GB | 262K | ||
| ERNIE 4.5 21B A3B PT | 21B | 6.2 GB | 131K |
Frequently Asked Questions
- How much VRAM do I need to run a ERNIE model?
- The smallest ERNIE model, ERNIE 4.5 0.3B PT, runs from 0.5 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
- Which ERNIE models can I run on a 16 GB GPU?
- 4 of 4 ERNIE models fit in 16 GB of VRAM at some quantization, including ERNIE 4.5 21B A3B PT, ERNIE 4.5 0.3B PT, ErniePEUnleashed.
- What is the most popular ERNIE model to run locally?
- ERNIE 4.5 21B A3B PT is the most downloaded ERNIE model in local-friendly quantized formats. It runs from 6.2 GB of VRAM.