ERNIE Models — Hardware Requirements

4 ERNIE models from Baidu and the community, from the smallest that runs in 0.5 GB of VRAM up to 21B parameters. Every row links to full quantization tables and GPU compatibility.

All ERNIE Models by Size

ModelParamsContext
ERNIE 4.5 0.3B PT361M131K
ERNIE 4.5 0.3B Paddle361M131K
ErniePEUnleashed3.4B262K
ERNIE 4.5 21B A3B PT21B131K

Frequently Asked Questions

How much VRAM do I need to run a ERNIE model?
The smallest ERNIE model, ERNIE 4.5 0.3B PT, runs from 0.5 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which ERNIE models can I run on a 16 GB GPU?
4 of 4 ERNIE models fit in 16 GB of VRAM at some quantization, including ERNIE 4.5 21B A3B PT, ERNIE 4.5 0.3B PT, ErniePEUnleashed.
What is the most popular ERNIE model to run locally?
ERNIE 4.5 21B A3B PT is the most downloaded ERNIE model in local-friendly quantized formats. It runs from 6.2 GB of VRAM.