Run LLMs on Your Phone — On-Device AI
Browse iPhones and Android phones for running quantized LLMs on-device. See which models fit in 8–16 GB of unified memory and run without an internet connection.
On-Device LLM Inference on Smartphones
Modern smartphones can run small quantized LLMs entirely on-device with no internet connection and no cloud costs. iPhones with A17 Pro or M-series chips and high-end Android devices with Snapdragon 8 Gen 3 are best suited for 1B–7B parameter models at aggressive quantization (Q4 or lower).
Phones List
Apple iPhone 17 Pro
Apple · A19 Pro · A19 Pro 6-core GPU · Phone
12 GB
76.8 GB/s6 GPU cores6 CPU cores
iPhone 15 Pro
Apple · A17 Pro · Phone
8 GB
6 GPU cores6 CPU cores
iPhone 15 Pro Max
Apple · A17 Pro · Phone
8 GB
6 GPU cores6 CPU cores
iPhone 16 Pro
Apple · A18 Pro · Phone
8 GB
6 GPU cores6 CPU cores
iPhone 16 Pro Max
Apple · A18 Pro · Phone
8 GB
6 GPU cores6 CPU cores
iPhone 17
Apple · A19 · A19 5-core GPU · Phone
8 GB
68.2 GB/s5 GPU cores6 CPU cores
iPhone 17 Pro Max
Apple · A19 Pro · Apple A19 Pro GPU (6-core) · Phone
12 GB
76.8 GB/s6 GPU cores6 CPU cores
iPhone Air
Apple · A19 Pro · Phone
12 GB
68.2 GB/s5 GPU cores6 CPU cores