GPUs with 40–undefined GB VRAM

Browse 27 GPUs with 40–undefined GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.

← Show all GPUs

Which GPU Do You Need for AI?

The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.

GPU List

AMD Instinct MI210

AMD · CDNA 2

64 GB
1638.4 GB/s6,656 SP300W TDP

AMD Instinct MI250X

AMD · CDNA 2

128 GB
3276.8 GB/s14,080 SP560W TDP

AMD Instinct MI300X

AMD · CDNA 3

192 GB
5300.0 GB/s19,456 SP750W TDP

AMD Instinct MI325X

AMD · CDNA 3

256 GB
6000.0 GB/s19,456 SP1000W TDP

AMD Instinct MI350X

AMD · CDNA 4

288 GB
8000.0 GB/s16,384 SP1000W TDP

AMD Instinct MI355X

AMD · CDNA 4

288 GB
8000.0 GB/s16,384 SP1400W TDP

AMD Radeon PRO W7900

AMD · RDNA 3

48 GB
864.0 GB/s6,144 SP295W TDP$3,999

Intel Arc Pro B60 Dual 48GB

Intel · Xe2 (Battlemage), 2x BMG-G21 (Xe2-HPG)

48 GB
456.0 GB/s400W TDP$1,200

NVIDIA A100 40GB PCIe

NVIDIA · Ampere

40 GB
1555.0 GB/s6,912 CUDA250W TDP

NVIDIA A100 80GB SXM

NVIDIA · Ampere

80 GB
2039.0 GB/s6,912 CUDA400W TDP

NVIDIA A40

NVIDIA · Ampere

48 GB
696.0 GB/s10,752 CUDA300W TDP

NVIDIA B200

NVIDIA · Blackwell

192 GB
8000.0 GB/s1000W TDP

NVIDIA B300

NVIDIA · Blackwell Ultra

288 GB
8000.0 GB/s20,480 CUDA1400W TDP

NVIDIA GH200 Grace Hopper Superchip

NVIDIA · Hopper (Grace Hopper)

144 GB
4900.0 GB/s16,896 CUDA1000W TDP

NVIDIA H100 PCIe

NVIDIA · Hopper

80 GB
2039.0 GB/s14,592 CUDA350W TDP

NVIDIA H100 SXM

NVIDIA · Hopper

80 GB
3352.0 GB/s16,896 CUDA700W TDP

NVIDIA H200 NVL

NVIDIA · Hopper

141 GB
4800.0 GB/s16,896 CUDA600W TDP

NVIDIA H200 SXM

NVIDIA · Hopper

141 GB
4800.0 GB/s16,896 CUDA700W TDP

NVIDIA L40

NVIDIA · Ada Lovelace

48 GB
864.0 GB/s18,176 CUDA300W TDP

NVIDIA L40S

NVIDIA · Ada Lovelace

48 GB
864.0 GB/s18,176 CUDA350W TDP

NVIDIA Quadro RTX 8000

NVIDIA · Turing

48 GB
672.0 GB/s4,608 CUDA260W TDP$9,999

NVIDIA RTX 6000 Ada Generation

NVIDIA · Ada Lovelace

48 GB
960.0 GB/s18,176 CUDA300W TDP$6,799

NVIDIA RTX A6000

NVIDIA · Ampere

48 GB
768.0 GB/s10,752 CUDA300W TDP$4,649

NVIDIA RTX PRO 5000 Blackwell

NVIDIA · Blackwell

72 GB
1344.0 GB/s14,080 CUDA300W TDP$4,500

NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition

NVIDIA · Blackwell

96 GB
1792.0 GB/s24,064 CUDA300W TDP$8,565

NVIDIA RTX PRO 6000 Blackwell Server Edition

NVIDIA · Blackwell

96 GB
1597.0 GB/s24,064 CUDA600W TDP

NVIDIA RTX PRO 6000 Blackwell Workstation Edition

NVIDIA · Blackwell

96 GB
1792.0 GB/s24,064 CUDA600W TDP$8,565