GLM 4.7 Flash Heretic 1.2.0 vs GLM 4.7 Flash Ultimate Irrefusable Heretic

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Specifications

GLM 4.7 Flash Heretic 1.2.0GLM 4.7 Flash Ultimate Irrefusable Heretic
Parameters29.9B29.9B
Context203K203K
ArchitectureGlm4MoeLiteForCausalLMGlm4MoeLiteForCausalLM
LicenseMITMIT
Downloads333125
ReleasedFeb 2026Mar 2026

VRAM by Quantization: GLM 4.7 Flash Heretic 1.2.0 vs GLM 4.7 Flash Ultimate Irrefusable Heretic

QuantizationBitsGLM 4.7 Flash Heretic 1.2.0 VRAMGLM 4.7 Flash Ultimate Irrefusable Heretic VRAM
Q2_K3.4013.8 GB13.8 GB
Q3_K_M3.9015.7 GB15.7 GB
Q3_K_S3.5014.2 GB14.2 GB
Q4_04.0016.1 GB16.1 GB
Q4_K_M4.8019.1 GB19.1 GB
Q5_K_M5.7022.4 GB22.4 GB
Q6_K6.6025.8 GB25.8 GB
Q8_08.0031.0 GB31.0 GB

Verdict

GLM 4.7 Flash Heretic 1.2.0 is the more widely downloaded of the two.

Frequently Asked Questions

Which needs less VRAM, GLM 4.7 Flash Heretic 1.2.0 or GLM 4.7 Flash Ultimate Irrefusable Heretic?

At Q4_K_M, GLM 4.7 Flash Heretic 1.2.0 needs 19.1 GB and GLM 4.7 Flash Ultimate Irrefusable Heretic needs 19.1 GB, so GLM 4.7 Flash Heretic 1.2.0 is the lighter option to run locally.

Which has a longer context window, GLM 4.7 Flash Heretic 1.2.0 or GLM 4.7 Flash Ultimate Irrefusable Heretic?

GLM 4.7 Flash Heretic 1.2.0 supports 202,752 tokens and GLM 4.7 Flash Ultimate Irrefusable Heretic supports 202,752 tokens.

What is the difference between GLM 4.7 Flash Heretic 1.2.0 and GLM 4.7 Flash Ultimate Irrefusable Heretic?

GLM 4.7 Flash Heretic 1.2.0 is a 29.9B model from darkc0de (GLM family), while GLM 4.7 Flash Ultimate Irrefusable Heretic is a 29.9B model from llmfan46 (GLM family). Compare their VRAM requirements above to see which fits your GPU or Mac.