Question 1

How much VRAM do I need to run a Gemma 2 model?

Accepted Answer

The smallest Gemma 2 model, Gemma 2 2B IT, runs from 0.9 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.

Question 2

Which Gemma 2 models can I run on a 16 GB GPU?

Accepted Answer

16 of 16 Gemma 2 models fit in 16 GB of VRAM at some quantization, including Gemma 2 2B IT, Gemma 2 2B IT Abliterated, Gemma 2 9B IT.

Question 3

What is the most popular Gemma 2 model to run locally?

Accepted Answer

Gemma 2 2B IT is the most downloaded Gemma 2 model in local-friendly quantized formats. It runs from 0.9 GB of VRAM.

Question 4

How do Gemma 2 models score on benchmarks?

Accepted Answer

Gemma 2 27B IT leads the family with an overall benchmark rating of 29.9/100, ranking #60 among 73 open models, while the top proprietary model, Claude Fable 5 Max, scores 89.9. See the comparison chart above for the full standings.

Model	Params	Runs from	Context	Publisher	Quant downloads
Gemma 2B IT	2.5B	1.2 GB	—	Google	1.3K
Gemma 2B	2.5B	1.2 GB	—	Google	1.3K
Codegemma 2B	2.5B	1.2 GB	—	Google	—
Nidum Gemma 2B Uncensored	2.5B	1.4 GB	8K	VibeStudio	—
Gemma 2 2B IT	2.6B	0.9 GB	8K	Google	466.7K
Gemma 2 2B IT Abliterated	2.6B	1.6 GB	8K	IlyaGusev	34.2K
Gemma 2 2B	2.6B	1.2 GB	—	Google	—
Gemma 2 2B Jpn IT	2.6B	5.8 GB	—	Google	—
Txgemma 2B Predict	2.6B	1.2 GB	—	Google	—
Shieldgemma 2B	2.6B	1.2 GB	—	Google	—
T5gemma 2B 2B Ul2	5.6B	2.6 GB	—	Google	—
Text2cypher Gemma 2 9B IT Finetuned 2024v1	9B	4.2 GB	—	neo4j	—
Gemma 2 9B IT	9.2B	3.0 GB	8K	Google	28.4K
Gemma 2 9B	9.2B	4.2 GB	—	Google	2.4K
Gemma 2 Mitra E	9.2B	4.8 GB	8K	buddhist-nlp	—
Gemma 2 27B IT	27.2B	9.0 GB	8K	Google	8.9K

Gemma 2 Models — Hardware Requirements

All Gemma 2 Models by Size

How Gemma 2 Compares — Benchmark Rating

Frequently Asked Questions