GPUs with 128–undefined GB VRAM
Browse 10 GPUs with 128–undefined GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
← Show all GPUsWhich GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
AMD Instinct MI250X
AMD · CDNA 2
3276.8 GB/s14,080 SP560W TDP
AMD Instinct MI300X
AMD · CDNA 3
5300.0 GB/s19,456 SP750W TDP
AMD Instinct MI325X
AMD · CDNA 3
6000.0 GB/s19,456 SP1000W TDP
AMD Instinct MI350X
AMD · CDNA 4
8000.0 GB/s16,384 SP1000W TDP
AMD Instinct MI355X
AMD · CDNA 4
8000.0 GB/s16,384 SP1400W TDP
NVIDIA B200
NVIDIA · Blackwell
8000.0 GB/s1000W TDP
NVIDIA B300
NVIDIA · Blackwell Ultra
8000.0 GB/s20,480 CUDA1400W TDP
NVIDIA GH200 Grace Hopper Superchip
NVIDIA · Hopper (Grace Hopper)
4900.0 GB/s16,896 CUDA1000W TDP
NVIDIA H200 NVL
NVIDIA · Hopper
4800.0 GB/s16,896 CUDA600W TDP
NVIDIA H200 SXM
NVIDIA · Hopper
4800.0 GB/s16,896 CUDA700W TDP