GLM 4.5 Air Derestricted vs GLM 4.6 Derestricted v3

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

GLM 4.5 Air Derestricted

ArliAI · 110.5B

Chat
GLM 4.6 Derestricted v3

ArliAI · 356.8B

Chat

Specifications

GLM 4.5 Air DerestrictedGLM 4.6 Derestricted v3
Parameters110.5B356.8B
Context131K203K
ArchitectureGlm4MoeForCausalLMGlm4MoeForCausalLM
LicenseMITMIT
Downloads1721.3K
ReleasedDec 2025Dec 2025

VRAM by Quantization: GLM 4.5 Air Derestricted vs GLM 4.6 Derestricted v3

QuantizationBitsGLM 4.5 Air Derestricted VRAMGLM 4.6 Derestricted v3 VRAM
Q2_K3.4047.4 GB
Q3_K_M3.9054.3 GB
Q3_K_S3.5048.8 GB
Q4_04.0055.7 GB
Q4_K_M4.8066.7 GB
Q5_K_M5.7079.1 GB
Q6_K6.6091.6 GB
Q8_08.00110.9 GB

Verdict

GLM 4.6 Derestricted v3 supports a longer context window (203K tokens). GLM 4.6 Derestricted v3 is the more widely downloaded of the two.

Frequently Asked Questions

Which has a longer context window, GLM 4.5 Air Derestricted or GLM 4.6 Derestricted v3?

GLM 4.5 Air Derestricted supports 131,072 tokens and GLM 4.6 Derestricted v3 supports 202,752 tokens.

What is the difference between GLM 4.5 Air Derestricted and GLM 4.6 Derestricted v3?

GLM 4.5 Air Derestricted is a 110.5B model from ArliAI (GLM family), while GLM 4.6 Derestricted v3 is a 356.8B model from ArliAI (GLM family). Compare their VRAM requirements above to see which fits your GPU or Mac.