Gemma 4 Models — Hardware Requirements
22 Gemma 4 models from Google and the community, from the smallest that runs in 2 GB of VRAM up to 32.7B parameters. Every row links to full quantization tables and GPU compatibility.
All Gemma 4 Models by Size
How Gemma 4 Compares — Benchmark Rating
Gemma 4 31B IT is the highest-rated Gemma 4 model with an overall benchmark rating of 50.8/100 — #34 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.
GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
DeepSeek V4 Pro77.5
Qwen3.6 27B74.0
StableBeluga269.1
MiniMax M2.768.4
Gemma 4 31B IT50.8
Frequently Asked Questions
- How much VRAM do I need to run a Gemma 4 model?
- The smallest Gemma 4 model, Gemma 4 E2B IT Qat Mobile Transformers, runs from 1.4 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
- Which Gemma 4 models can I run on a 16 GB GPU?
- 34 of 37 Gemma 4 models fit in 16 GB of VRAM at some quantization, including Gemma 4 26B A4B IT, Gemma 4 31B IT, Gemma 4 E4B IT.
- What is the most popular Gemma 4 model to run locally?
- Gemma 4 26B A4B IT is the most downloaded Gemma 4 model in local-friendly quantized formats. It runs from 8.0 GB of VRAM.
- How do Gemma 4 models score on benchmarks?
- Gemma 4 31B IT leads the family with an overall benchmark rating of 50.8/100, ranking #34 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.