Servers & DGX Systems for Local LLMs

Browse enterprise servers, NVIDIA DGX systems, and multi-GPU workstations providing 80–640 GB of VRAM for running the largest open-weight models.

High-VRAM Servers for Frontier-Scale Models

Servers and DGX systems offer the highest memory capacity available — NVLink bridges multiple GPUs into a shared VRAM pool of 160 GB or more. This enables running frontier-size 70B–405B parameter models at full or near-full precision without quantization compromises.

NVIDIA DGX A100 640GB

NVIDIA · 8x A100 SXM4 · Server

640 GB

16312.0 GB/s55296 GPU cores128 CPU cores

NVIDIA DGX H100

NVIDIA · 8x H100 SXM5 · Server

640 GB

26800.0 GB/s135168 GPU cores112 CPU cores

Servers & DGX Systems for Local LLMs

High-VRAM Servers for Frontier-Scale Models

Servers List

NVIDIA DGX A100 640GB

NVIDIA DGX H100