GPUs with 6–10 GB VRAM
Browse 13 GPUs with 6–10 GB VRAM compatible with running LLM models locally. Compare VRAM, memory bandwidth, and AI performance.
← Show all GPUsWhich GPU Do You Need for AI?
The amount of VRAM is the most important specification for running LLMs locally. Most 7B parameter models require 4–8 GB of VRAM at common quantization levels, while 70B models need 24–48 GB. Memory bandwidth determines how fast the model generates tokens — faster bandwidth means faster responses.
GPU List
AMD Radeon RX 7600
AMD · RDNA 3
288.0 GB/s2,048 SP165W TDP$269
Intel Arc A750
Intel · Alchemist
512.0 GB/s225W TDP$289
Intel Arc B570
Intel · Xe2 (Battlemage), BMG-G21
380.0 GB/s2,304 SP150W TDP$219
NVIDIA GeForce RTX 3050 8GB
NVIDIA · Ampere
224.0 GB/s2,560 CUDA2,560 SP130W TDP$249
NVIDIA GeForce RTX 3060 8GB
NVIDIA · Ampere
240.0 GB/s3,584 CUDA170W TDP$329
NVIDIA GeForce RTX 3060 Ti
NVIDIA · Ampere
448.0 GB/s4,864 CUDA200W TDP$399
NVIDIA GeForce RTX 3070
NVIDIA · Ampere
448.0 GB/s5,888 CUDA220W TDP$499
NVIDIA GeForce RTX 3070 Ti
NVIDIA · Ampere
608.3 GB/s6,144 CUDA290W TDP$599
NVIDIA GeForce RTX 3080
NVIDIA · Ampere
760.3 GB/s8,704 CUDA320W TDP$699
NVIDIA GeForce RTX 4060
NVIDIA · Ada Lovelace
272.0 GB/s3,072 CUDA115W TDP$299
NVIDIA GeForce RTX 4060 Ti 8GB
NVIDIA · Ada Lovelace
288.0 GB/s4,352 CUDA160W TDP$399
NVIDIA GeForce RTX 5060
NVIDIA · Blackwell
448.0 GB/s3,840 CUDA145W TDP$299
NVIDIA GeForce RTX 5060 Ti 8GB
NVIDIA · Blackwell
448.0 GB/s4,608 CUDA180W TDP$379