GPUs with 16–undefined GB VRAM
Browse 70 GPUs with 16–undefined GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
← Show all GPUsWhich GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
NVIDIA GeForce RTX 3090
NVIDIA · Ampere
NVIDIA GeForce RTX 3090 Ti
NVIDIA · Ampere
NVIDIA GeForce RTX 4060 Ti 16GB
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4070 Ti SUPER
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4080
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4080 SUPER
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4090
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 4090 Laptop GPU
NVIDIA · Ada Lovelace
NVIDIA GeForce RTX 5060 Ti 16GB
NVIDIA · Blackwell
NVIDIA GeForce RTX 5070 Ti
NVIDIA · Blackwell
NVIDIA GeForce RTX 5080
NVIDIA · Blackwell
NVIDIA GeForce RTX 5090
NVIDIA · Blackwell
NVIDIA GeForce RTX 5090 Laptop GPU
NVIDIA · Blackwell
NVIDIA H100 PCIe
NVIDIA · Hopper
NVIDIA H100 SXM
NVIDIA · Hopper
NVIDIA H200 NVL
NVIDIA · Hopper
NVIDIA H200 SXM
NVIDIA · Hopper
NVIDIA L4
NVIDIA · Ada Lovelace
NVIDIA L40
NVIDIA · Ada Lovelace
NVIDIA L40S
NVIDIA · Ada Lovelace
NVIDIA Quadro RTX 8000
NVIDIA · Turing
NVIDIA RTX 4000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX 5000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX 6000 Ada Generation
NVIDIA · Ada Lovelace
NVIDIA RTX A4000
NVIDIA · Ampere
NVIDIA RTX A5000
NVIDIA · Ampere
NVIDIA RTX A6000
NVIDIA · Ampere
NVIDIA RTX PRO 4000 Blackwell
NVIDIA · Blackwell
NVIDIA RTX PRO 4500 Blackwell
NVIDIA · Blackwell
NVIDIA RTX PRO 5000 Blackwell
NVIDIA · Blackwell