GPUs with 20–undefined GB VRAM
Browse 47 GPUs with 20–undefined GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
← Show all GPUsWhich GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
NVIDIA L40S
NVIDIA · Ada Lovelace
864.0 GB/s18,176 CUDA350W TDP
NVIDIA Quadro RTX 8000
NVIDIA · Turing
672.0 GB/s4,608 CUDA260W TDP$9,999
NVIDIA RTX 4000 Ada Generation
NVIDIA · Ada Lovelace
360.0 GB/s6,144 CUDA130W TDP$1,250
NVIDIA RTX 5000 Ada Generation
NVIDIA · Ada Lovelace
576.0 GB/s12,800 CUDA250W TDP$4,000
NVIDIA RTX 6000 Ada Generation
NVIDIA · Ada Lovelace
960.0 GB/s18,176 CUDA300W TDP$6,799
NVIDIA RTX A5000
NVIDIA · Ampere
768.0 GB/s8,192 CUDA230W TDP$2,250
NVIDIA RTX A6000
NVIDIA · Ampere
768.0 GB/s10,752 CUDA300W TDP$4,649
NVIDIA RTX PRO 4000 Blackwell
NVIDIA · Blackwell
672.0 GB/s8,960 CUDA145W TDP$1,500
NVIDIA RTX PRO 4500 Blackwell
NVIDIA · Blackwell
896.0 GB/s10,496 CUDA200W TDP$2,600
NVIDIA RTX PRO 5000 Blackwell
NVIDIA · Blackwell
1344.0 GB/s14,080 CUDA300W TDP$4,500
NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition
NVIDIA · Blackwell
1792.0 GB/s24,064 CUDA300W TDP$8,565
NVIDIA RTX PRO 6000 Blackwell Server Edition
NVIDIA · Blackwell
1597.0 GB/s24,064 CUDA600W TDP
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
NVIDIA · Blackwell
1792.0 GB/s24,064 CUDA600W TDP$8,565
NVIDIA TITAN RTX
NVIDIA · Turing
672.0 GB/s4,608 CUDA280W TDP$2,499
NVIDIA Tesla M40 24GB
NVIDIA · Maxwell
288.0 GB/s3,072 CUDA250W TDP
NVIDIA Tesla P40
NVIDIA · Pascal
346.0 GB/s3,840 CUDA3,840 SP250W TDP
NVIDIA V100 SXM2 32GB
NVIDIA · Volta
900.0 GB/s5,120 CUDA300W TDP