Llama 4 Maverick 17B 128E Instruct — Hardware Requirements & GPU Compatibility
VisionLlama 4 Maverick 17B 128E Instruct is a 401.6B-parameter open language model from Meta in the Llama family. At BF16 it needs about 883.48 GB of VRAM — see which GPUs and Macs can run it below.
Specifications
- Publisher
- Meta
- Family
- Llama
- Parameters
- 401.6B
- License
- Other
Get Started
How Much VRAM Does Llama 4 Maverick 17B 128E Instruct Need?
Select a quantization to see compatible GPUs below.
| Quantization | Bits | VRAM | + Context | File Size | Quality |
|---|---|---|---|---|---|
| BF16 | 16.00 | 883.5 GB | — | 803.17 GB | Brain floating point 16 — preferred for training |
Which GPUs Can Run Llama 4 Maverick 17B 128E Instruct?
BF16 · 883.5 GBLlama 4 Maverick 17B 128E Instruct (BF16) requires 883.5 GB of VRAM to load the model weights. For comfortable inference with headroom for KV cache and system overhead, 1149+ GB is recommended. No single GPU has enough memory — multi-GPU or cluster setups are needed.
Benchmarks
View all 8 →Related Models
Frequently Asked Questions
- How much VRAM does Llama 4 Maverick 17B 128E Instruct need?
Llama 4 Maverick 17B 128E Instruct requires 883.5 GB of VRAM at BF16.
VRAM = Weights + KV Cache + Overhead
Weights = 401.6B × 16 bits ÷ 8 = 803.2 GB
KV Cache + Overhead ≈ 80.3 GB (at 2K context + ~0.3 GB framework)
VRAM usage by quantization
BF16883.5 GB- Can NVIDIA GeForce RTX 5090 run Llama 4 Maverick 17B 128E Instruct?
No — Llama 4 Maverick 17B 128E Instruct requires at least 883.5 GB at BF16, which exceeds the NVIDIA GeForce RTX 5090's 32 GB of VRAM.
- Can I run Llama 4 Maverick 17B 128E Instruct on a Mac?
Llama 4 Maverick 17B 128E Instruct requires at least 883.5 GB at BF16, which exceeds the unified memory of most consumer Macs. You would need a Mac Studio or Mac Pro with a high-memory configuration.
- Can I run Llama 4 Maverick 17B 128E Instruct locally?
Yes — Llama 4 Maverick 17B 128E Instruct can run locally on consumer hardware. At BF16 quantization it needs 883.5 GB of VRAM. Popular tools include Ollama, LM Studio, and llama.cpp.
- What's the download size of Llama 4 Maverick 17B 128E Instruct?
At BF16, the download is about 803.17 GB.
- Which GPUs can run Llama 4 Maverick 17B 128E Instruct?
No single consumer GPU has enough VRAM to run Llama 4 Maverick 17B 128E Instruct at BF16 (883.5 GB). Multi-GPU or professional hardware is required.
- Which devices can run Llama 4 Maverick 17B 128E Instruct?
Llama 4 Maverick 17B 128E Instruct requires at least 883.5 GB at BF16, which exceeds the unified memory of most consumer devices. A high-memory Mac Studio, Mac Pro, or multi-GPU desktop setup is recommended.