SmolLM Models — Hardware Requirements
13 SmolLM models from Hugging Face and the community, from the smallest that runs in 0.4 GB of VRAM up to 3.1B parameters. Every row links to full quantization tables and GPU compatibility.
All SmolLM Models by Size
| Model | Params | Runs from | Context | Publisher | Quant downloads |
|---|---|---|---|---|---|
| SmolLM2 70M | 69M | 0.4 GB | 8K | ||
| SmolLM2 135M Instruct | 135M | 0.4 GB | 8K | ||
| SmolLM 135M | 135M | 0.4 GB | 2K | ||
| SmolLM2 135M | 135M | 0.4 GB | 8K | ||
| SmolLM2 360M Instruct | 362M | 0.5 GB | 8K | ||
| SmolLM2 360M | 362M | 0.5 GB | 8K | ||
| SmolLM 360M Instruct | 362M | 0.5 GB | 2K | ||
| SmolLM2 1.7B Instruct | 1.7B | 1.4 GB | 8K | ||
| SmolLM2 1.7B | 1.7B | 1.4 GB | 8K | ||
| SmolLM 1.7B | 1.7B | 1.4 GB | 2K | ||
| SmolLM3 3B Base | 3B | 1.3 GB | 66K | ||
| SmolLM3 3B ONNX | 3B | 1.7 GB | 66K | ||
| SmolLM3 3B | 3.1B | 1.3 GB | 66K |
Frequently Asked Questions
- How much VRAM do I need to run a SmolLM model?
- The smallest SmolLM model, SmolLM2 70M, runs from 0.4 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
- Which SmolLM models can I run on a 16 GB GPU?
- 13 of 13 SmolLM models fit in 16 GB of VRAM at some quantization, including SmolLM2 135M Instruct, SmolLM2 1.7B Instruct, SmolLM2 360M Instruct.
- What is the most popular SmolLM model to run locally?
- SmolLM2 135M Instruct is the most downloaded SmolLM model in local-friendly quantized formats. It runs from 0.4 GB of VRAM.