GPUs with 128–undefined GB VRAM

Browse 10 GPUs with 128–undefined GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.

← Show all GPUs

Which GPU Do You Need for AI?

The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.

AMD Instinct MI250X

AMD · CDNA 2

3276.8 GB/s14,080 SP560W TDP

AMD Instinct MI300X

AMD · CDNA 3

5300.0 GB/s19,456 SP750W TDP

AMD Instinct MI325X

AMD · CDNA 3

6000.0 GB/s19,456 SP1000W TDP

AMD Instinct MI350X

AMD · CDNA 4

8000.0 GB/s16,384 SP1000W TDP

AMD Instinct MI355X

AMD · CDNA 4

8000.0 GB/s16,384 SP1400W TDP

NVIDIA B200

NVIDIA · Blackwell

8000.0 GB/s1000W TDP

NVIDIA B300

NVIDIA · Blackwell Ultra

8000.0 GB/s20,480 CUDA1400W TDP

NVIDIA GH200 Grace Hopper Superchip

NVIDIA · Hopper (Grace Hopper)

4900.0 GB/s16,896 CUDA1000W TDP

NVIDIA H200 NVL

NVIDIA · Hopper

4800.0 GB/s16,896 CUDA600W TDP

NVIDIA H200 SXM

NVIDIA · Hopper

4800.0 GB/s16,896 CUDA700W TDP