All GPUs for Running LLMs Locally
Browse 94 GPUs compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
Which GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
NVIDIA A100 40GB PCIe
NVIDIA · Ampere
NVIDIA A100 80GB SXM
NVIDIA · Ampere
NVIDIA A40
NVIDIA · Ampere
NVIDIA B200
NVIDIA · Blackwell
NVIDIA B300
NVIDIA · Blackwell Ultra
NVIDIA GH200 Grace Hopper Superchip
NVIDIA · Hopper (Grace Hopper)
NVIDIA GeForce GTX 1080 Ti
NVIDIA · Pascal
NVIDIA GeForce RTX 2080 Ti
NVIDIA · Turing
NVIDIA GeForce RTX 3050 8GB
NVIDIA · Ampere
NVIDIA GeForce RTX 3060 12GB
NVIDIA · Ampere
NVIDIA GeForce RTX 3060 8GB
NVIDIA · Ampere
NVIDIA GeForce RTX 3060 Ti
NVIDIA · Ampere
NVIDIA GeForce RTX 3070
NVIDIA · Ampere
NVIDIA GeForce RTX 3070 Ti
NVIDIA · Ampere
NVIDIA GeForce RTX 3080
NVIDIA · Ampere
NVIDIA GeForce RTX 3080 Ti
NVIDIA · Ampere
NVIDIA GeForce RTX 3090
NVIDIA · Ampere
NVIDIA GeForce RTX 3090 Ti
NVIDIA · Ampere
NVIDIA GeForce RTX 4060
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4060 Ti 16GB
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4060 Ti 8GB
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4070
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4070 SUPER
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4070 Ti
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4070 Ti SUPER
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4080
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4080 SUPER
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4090
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4090 Laptop GPU
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 5060
NVIDIA · Blackwell