Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER vs Qwen1.5 72B Chat

Side-by-side comparison of VRAM requirements, quantization, context length, and hardware compatibility.

Specifications

Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODERQwen1.5 72B Chat
Parameters42.4B72.3B
Context262K33K
ArchitectureQwen3MoeForCausalLMQwen2ForCausalLM
LicenseApache 2.0Other
Downloads1.9K10.1K
ReleasedAug 2025

VRAM by Quantization: Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER vs Qwen1.5 72B Chat

QuantizationBitsQwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER VRAMQwen1.5 72B Chat VRAM
Q2_K3.4036.4 GB
Q3_K_M3.9040.9 GB
Q3_K_S3.5037.3 GB
Q4_K_M4.8049.0 GB

Verdict

Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER supports a longer context window (262K tokens). Qwen1.5 72B Chat is the more widely downloaded of the two.

Frequently Asked Questions

Which has a longer context window, Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER or Qwen1.5 72B Chat?

Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER supports 262,144 tokens and Qwen1.5 72B Chat supports 32,768 tokens.

What is the difference between Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER and Qwen1.5 72B Chat?

Qwen3 42B A3B 2507 Thinking Abliterated Uncensored TOTAL RECALL v2 Medium MASTER CODER is a 42.4B model from DavidAU (Qwen family), while Qwen1.5 72B Chat is a 72.3B model from Alibaba (Qwen family). Compare their VRAM requirements above to see which fits your GPU or Mac.