All GPUs for Running LLMs Locally
Browse 94 GPUs compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
Which GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
NVIDIA GeForce RTX 5060 Ti 16GB
NVIDIA · Blackwell
NVIDIA GeForce RTX 5060 Ti 8GB
NVIDIA · Blackwell
NVIDIA GeForce RTX 5070
NVIDIA · Blackwell
NVIDIA GeForce RTX 5070 Ti
NVIDIA · Blackwell
NVIDIA GeForce RTX 5080
NVIDIA · Blackwell
NVIDIA GeForce RTX 5090
NVIDIA · Blackwell
NVIDIA GeForce RTX 5090 Laptop GPU
NVIDIA · Blackwell
NVIDIA H100 PCIe
NVIDIA · Hopper
NVIDIA H100 SXM
NVIDIA · Hopper
NVIDIA H200 NVL
NVIDIA · Hopper
NVIDIA H200 SXM
NVIDIA · Hopper
NVIDIA L4
NVIDIA · Ada Lovelace
NVIDIA L40
NVIDIA · Ada Lovelace
NVIDIA L40S
NVIDIA · Ada Lovelace
NVIDIA Quadro RTX 8000
NVIDIA · Turing
NVIDIA RTX 4000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX 5000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX 6000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX A4000
NVIDIA · Ampere
NVIDIA RTX A5000
NVIDIA · Ampere
NVIDIA RTX A6000
NVIDIA · Ampere
NVIDIA RTX PRO 4000 Blackwell
NVIDIA · Blackwell
NVIDIA RTX PRO 4500 Blackwell
NVIDIA · Blackwell
NVIDIA RTX PRO 5000 Blackwell
NVIDIA · Blackwell
NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition
NVIDIA · Blackwell
NVIDIA RTX PRO 6000 Blackwell Server Edition
NVIDIA · Blackwell
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
NVIDIA · Blackwell
NVIDIA T4
NVIDIA · Turing
NVIDIA TITAN RTX
NVIDIA · Turing
NVIDIA Tesla M40 24GB
NVIDIA · Maxwell