All GPUs for Running LLMs Locally

Browse 56 GPUs compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.

Which GPU Do You Need for AI?

The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.

GPU List

AMD Instinct MI210

AMD · CDNA 2

64 GB
1638.4 GB/s6,656 SP300W TDP

AMD Instinct MI250X

AMD · CDNA 2

128 GB
3276.8 GB/s14,080 SP560W TDP

AMD Instinct MI300X

AMD · CDNA 3

192 GB
5300.0 GB/s19,456 SP750W TDP

AMD Radeon PRO W7800

AMD · RDNA 3

32 GB
576.0 GB/s4,480 SP260W TDP$2,499

AMD Radeon PRO W7900

AMD · RDNA 3

48 GB
864.0 GB/s6,144 SP295W TDP$3,999

AMD Radeon RX 6700 XT

AMD · RDNA 2

12 GB
384.0 GB/s2,560 SP230W TDP$479

AMD Radeon RX 6800

AMD · RDNA 2

16 GB
512.0 GB/s3,840 SP250W TDP$579

AMD Radeon RX 6800 XT

AMD · RDNA 2

16 GB
512.0 GB/s4,608 SP300W TDP$649

AMD Radeon RX 6900 XT

AMD · RDNA 2

16 GB
512.0 GB/s5,120 SP300W TDP$999

AMD Radeon RX 7600

AMD · RDNA 3

8 GB
288.0 GB/s2,048 SP165W TDP$269

AMD Radeon RX 7700 XT

AMD · RDNA 3

12 GB
432.0 GB/s3,456 SP245W TDP$449

AMD Radeon RX 7800 XT

AMD · RDNA 3

16 GB
624.0 GB/s3,840 SP263W TDP$499

AMD Radeon RX 7900 XT

AMD · RDNA 3

20 GB
800.0 GB/s5,376 SP315W TDP$899

AMD Radeon RX 7900 XTX

AMD · RDNA 3

24 GB
960.0 GB/s6,144 SP355W TDP$999

Intel Arc A750

Intel · Alchemist

8 GB
512.0 GB/s225W TDP$289

Intel Arc A770 16GB

Intel · Alchemist

16 GB
560.0 GB/s225W TDP$349

Intel Arc B580

Intel · Battlemage

12 GB
456.0 GB/s190W TDP$249

NVIDIA A100 40GB PCIe

NVIDIA · Ampere

40 GB
1555.0 GB/s6,912 CUDA250W TDP

NVIDIA A100 80GB SXM

NVIDIA · Ampere

80 GB
2039.0 GB/s6,912 CUDA400W TDP

NVIDIA A40

NVIDIA · Ampere

48 GB
696.0 GB/s10,752 CUDA300W TDP

NVIDIA GeForce GTX 1080 Ti

NVIDIA · Pascal

11 GB
484.4 GB/s3,584 CUDA250W TDP$699

NVIDIA GeForce RTX 3060 12GB

NVIDIA · Ampere

12 GB
360.0 GB/s3,584 CUDA170W TDP$329

NVIDIA GeForce RTX 3060 Ti

NVIDIA · Ampere

8 GB
448.0 GB/s4,864 CUDA200W TDP$399

NVIDIA GeForce RTX 3070

NVIDIA · Ampere

8 GB
448.0 GB/s5,888 CUDA220W TDP$499

NVIDIA GeForce RTX 3070 Ti

NVIDIA · Ampere

8 GB
608.3 GB/s6,144 CUDA290W TDP$599

NVIDIA GeForce RTX 3080

NVIDIA · Ampere

10 GB
760.3 GB/s8,704 CUDA320W TDP$699

NVIDIA GeForce RTX 3080 Ti

NVIDIA · Ampere

12 GB
912.4 GB/s10,240 CUDA350W TDP$1,199

NVIDIA GeForce RTX 3090

NVIDIA · Ampere

24 GB
936.2 GB/s10,496 CUDA350W TDP$1,499

NVIDIA GeForce RTX 3090 Ti

NVIDIA · Ampere

24 GB
1008.0 GB/s10,752 CUDA450W TDP$1,999

NVIDIA GeForce RTX 4060

NVIDIA · Ada Lovelace

8 GB
272.0 GB/s3,072 CUDA115W TDP$299