Gemma 4 Models — Hardware Requirements

22 Gemma 4 models from Google and the community, from the smallest that runs in 2 GB of VRAM up to 32.7B parameters. Every row links to full quantization tables and GPU compatibility.

All Gemma 4 Models by Size

ModelParamsContext
Gemma 4 E2B IT Qat Mobile Transformers2.3B131K
Gemma 4 E4B IT Assistant4B131K
Turkish Gemma 4B T1 Scout4.3B131K
Gemma 4 E2B IT Qat Q4 0 Unquantized5.1B131K
Gemma 4 E2B IT Qat Q4 0 Unquantized Heretic5.1B131K
Gemma 4 E2B IT5.1B131K
Gemma 4 E2B IT Uncensored5.1B131K
Supergemma4 E4b Abliterated7.5B131K
Gemma 4 E4B IT Qat Q4 0 Unquantized7.9B131K
Gemma 4 E4B IT OBLITERATED8.0B131K
Gemma4 E4B MiniFantasy V18.0B131K
Gemma 4 E4B IT8.0B131K
Gemma 4 E4B IT Ultra Uncensored Heretic8.0B131K
Gemma 4 E4B Luchador8.0B131K
Gemma 4 12B IT Heretic12.0B131K
Gemma 4 12B IT12.0B262K
Gemma 4 12B IT Qat Q4 0 Unquantized12.0B262K
Gemma 4 12B OBLITERATED12.0B131K
Gemma 4 12B IT AEON Abliterated K4 BF1612.0B262K
Gemma 4 12B12.0B262K
Gemma 4 12B IT Abliterated Uncensored12.0B131K
Gemma 4 12B IT Assistant12B262K
Gemma4 12B Mtp Assistant12B
Gemma 4 19B19.0B262K
Gemma 4 26B A4B IT Uncensored25.8B262K
Gemma 4 26B A4B IT Uncensored Heretic25.8B262K
Gemma 4 26B A4B IT Assistant26B262K
Gemma 4 26B A4B IT DFlash26B262K
Gemma 4 26B A4B IT26.5B262K
Gemma 4 26B A4B IT Qat Q4 0 Unquantized26.5B262K
Gemma 4 31B IT Qat Q4 0 Unquantized Assistant31B131K
Gemma 4 31B IT Speculator.eagle331B
Gemma 4 31B IT DFlash31B262K
Gemma 4 31B IT Uncensored Heretic31.3B262K
Gemma 4 31B IT32.7B262K
Gemma 4 31B IT Qat Q4 0 Unquantized32.7B262K
Gemma 4 31B IT Uncensored32.7B262K

How Gemma 4 Compares — Benchmark Rating

Gemma 4 31B IT is the highest-rated Gemma 4 model with an overall benchmark rating of 50.8/100 — #34 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.

GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
Composite of normalized public benchmark scores (methodology) · Gemma 4 · other models

Frequently Asked Questions

How much VRAM do I need to run a Gemma 4 model?
The smallest Gemma 4 model, Gemma 4 E2B IT Qat Mobile Transformers, runs from 1.4 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which Gemma 4 models can I run on a 16 GB GPU?
34 of 37 Gemma 4 models fit in 16 GB of VRAM at some quantization, including Gemma 4 26B A4B IT, Gemma 4 31B IT, Gemma 4 E4B IT.
What is the most popular Gemma 4 model to run locally?
Gemma 4 26B A4B IT is the most downloaded Gemma 4 model in local-friendly quantized formats. It runs from 8.0 GB of VRAM.
How do Gemma 4 models score on benchmarks?
Gemma 4 31B IT leads the family with an overall benchmark rating of 50.8/100, ranking #34 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.