GLM 5 Models — Hardware Requirements

3 GLM 5 models from zai-org and the community, from the smallest that runs in 211.5 GB of VRAM up to 753.9B parameters. Every row links to full quantization tables and GPU compatibility.

All GLM 5 Models by Size

ModelParamsContext
GLM 5.1753.9B203K
GLM 5753.9B203K
GLM 5 Abliterated753.9B203K

How GLM 5 Compares — Benchmark Rating

GLM 5.1 is the highest-rated GLM 5 model with an overall benchmark rating of 63.2/100 — #12 among 75 open models. The top proprietary model, GPT 5.5, scores 88.8. Click a model to see its full benchmark breakdown.

GPT 5.5 · proprietary88.8
Claude Opus 4.7 · proprietary87.6
Claude Fable 5 · proprietary86.6
GPT 5.4 · proprietary86.6
Claude Opus 4.8 · proprietary84.4
GLM 562.9
Composite of normalized public benchmark scores (methodology) · GLM 5 · other models

Frequently Asked Questions

How much VRAM do I need to run a GLM 5 model?
The smallest GLM 5 model, GLM 5.1, runs from 211.5 GB of VRAM at an aggressive quantization. Larger family members need proportionally more — see the table above for every model.
Which GLM 5 models can I run on a 16 GB GPU?
No GLM 5 model currently fits in 16 GB of VRAM — the family starts at 211.5 GB.
What is the most popular GLM 5 model to run locally?
GLM 5.1 is the most downloaded GLM 5 model in local-friendly quantized formats. It runs from 211.5 GB of VRAM.
How do GLM 5 models score on benchmarks?
GLM 5.1 leads the family with an overall benchmark rating of 63.2/100, ranking #12 among 75 open models, while the top proprietary model, GPT 5.5, scores 88.8. See the comparison chart above for the full standings.