Phi 3 Models — Hardware Requirements

6 Phi 3 models from Microsoft and the community, from the smallest that runs in 2.3 GB of VRAM up to 41.9B parameters. Every row links to full quantization tables and GPU compatibility.

All Phi 3 Models by Size

ModelParamsContext
Phi 3.5 Mini Instruct3.8B131K
Phi 3 Mini 4k Instruct3.8B4K
Phi 3 Mini 128k Instruct3.8B131K
Phi 3 Small 8k Instruct7.4B8K
Phi 3 Medium 4k Instruct14.0B4K
Phi 3.5 MoE Instruct41.9B131K

Frequently Asked Questions

How much VRAM do I need to run a Phi 3 model?
The smallest Phi 3 model, Phi 3.5 Mini Instruct, runs from 2.3 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Phi 3 models can I run on a 16 GB GPU?
6 of 6 Phi 3 models fit in 16 GB of VRAM at some quantization, including Phi 3.5 Mini Instruct, Phi 3 Mini 4k Instruct, Phi 3.5 MoE Instruct.
What is the most popular Phi 3 model to run locally?
Phi 3.5 Mini Instruct is the most downloaded Phi 3 model in local-friendly quantized formats. It runs from 2.3 GB of VRAM.