GLM 4.6 Derestricted v3 vs GLM 4.7 Flash Ultimate Irrefusable Heretic

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Specifications

GLM 4.6 Derestricted v3GLM 4.7 Flash Ultimate Irrefusable Heretic
Parameters356.8B29.9B
Context203K203K
ArchitectureGlm4MoeForCausalLMGlm4MoeLiteForCausalLM
LicenseMITMIT
Downloads1.3K125
ReleasedDec 2025Mar 2026

VRAM by Quantization: GLM 4.6 Derestricted v3 vs GLM 4.7 Flash Ultimate Irrefusable Heretic

QuantizationBitsGLM 4.6 Derestricted v3 VRAMGLM 4.7 Flash Ultimate Irrefusable Heretic VRAM
Q2_K3.4013.8 GB
Q3_K_M3.9015.7 GB
Q3_K_S3.5014.2 GB
Q4_04.0016.1 GB
Q4_K_M4.8019.1 GB
Q5_K_M5.7022.4 GB
Q6_K6.6025.8 GB
Q8_08.0031.0 GB

Verdict

GLM 4.6 Derestricted v3 is the more widely downloaded of the two.

Frequently Asked Questions

Which has a longer context window, GLM 4.6 Derestricted v3 or GLM 4.7 Flash Ultimate Irrefusable Heretic?

GLM 4.6 Derestricted v3 supports 202,752 tokens and GLM 4.7 Flash Ultimate Irrefusable Heretic supports 202,752 tokens.

What is the difference between GLM 4.6 Derestricted v3 and GLM 4.7 Flash Ultimate Irrefusable Heretic?

GLM 4.6 Derestricted v3 is a 356.8B model from ArliAI (GLM family), while GLM 4.7 Flash Ultimate Irrefusable Heretic is a 29.9B model from llmfan46 (GLM family). Compare their VRAM requirements above to see which fits your GPU or Mac.