Question 1

How much VRAM do I need to run a Gemma model?

Accepted Answer

The smallest Gemma model, Functiongemma 270M IT, runs from 0.1 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.

Question 2

Which Gemma models can I run on a 16 GB GPU?

Accepted Answer

11 of 11 Gemma models fit in 16 GB of VRAM at some quantization, including Functiongemma 270M IT, Medgemma 27B Text IT, Gemma 7B.

Question 3

What is the most popular Gemma model to run locally?

Accepted Answer

Functiongemma 270M IT is the most downloaded Gemma model in local-friendly quantized formats. It runs from 0.1 GB of VRAM.

Question 4

How do Gemma models score on benchmarks?

Accepted Answer

Gemma 7B leads the family with an overall benchmark rating of 51.3/100, ranking #31 among 73 open models, while the top proprietary model, Claude Fable 5 Max, scores 89.9. See the comparison chart above for the full standings.

Model	Params	Runs from	Context	Publisher	Quant downloads
Functiongemma 270M IT	268M	0.1 GB	—	Google	10.2K
Functiongemma 270M Ft Mobile Actions	270M	0.6 GB	—	litert-community	—
T5gemma B B Ul2 IT	591M	1.3 GB	—	Google	—
Vaultgemma 1B	1.0B	2.3 GB	—	Google	—
T5gemma L L Ul2 IT	1.2B	2.7 GB	—	Google	—
Gemma 7B	8.5B	4.0 GB	—	Google	1.6K
Gemma 7B IT	8.5B	4.0 GB	—	Google	437
Codegemma 7B IT	8.5B	4.0 GB	—	Google	—
Turkish Gemma 9B T1	9.2B	4.8 GB	8K	ytu-ce-cosmos	—
Turkish Gemma 9B v0.1	9.2B	4.8 GB	8K	ytu-ce-cosmos	—
Medgemma 27B Text IT	27.0B	8.2 GB	—	Google	3.6K

Gemma Models — Hardware Requirements

All Gemma Models by Size

How Gemma Compares — Benchmark Rating

Frequently Asked Questions