heafy99/Qwen3-4B-Instruct-2507-heretic
heafy99/Qwen3-4B-Instruct-2507-heretic is a 4 billion parameter instruction-tuned causal language model, based on the Qwen3-4B-Instruct-2507 architecture developed by Qwen. This model is a decensored version, created using Heretic v1.3.0, specifically designed to reduce refusals and enhance open-ended responses. It features a native context length of 262,144 tokens and excels in general capabilities including instruction following, logical reasoning, and agentic tool usage.
Loading preview...
What the fuck is this model about?
This model, heafy99/Qwen3-4B-Instruct-2507-heretic, is a decensored version of the Qwen3-4B-Instruct-2507 large language model, created by applying the Heretic v1.3.0 tool. It retains the core capabilities of the original Qwen3-4B-Instruct-2507, which is a 4 billion parameter instruction-tuned causal language model developed by Qwen. A key feature is its native 262,144 token context length, enabling extensive long-context understanding.
What makes THIS different from all the other models?
The primary differentiator is its decensored nature. While the original Qwen3-4B-Instruct-2507 had a refusal rate of 100/100, this 'heretic' version significantly reduces refusals to 7/100, as measured by KL divergence of 0.0497. This makes it suitable for use cases requiring less restrictive content generation. It also boasts reproducibility through documented abliteration parameters.
Should I use this for my use case?
This model is particularly well-suited for applications where:
- Reduced content moderation or decensored outputs are desired.
- Long-context understanding is critical, thanks to its 262K native context window.
- You need strong performance in instruction following, logical reasoning, text comprehension, mathematics, science, coding, and tool usage.
- Agentic capabilities are important, as it excels in tool calling and integrates with Qwen-Agent.
Consider its use if you require a powerful 4B parameter model with enhanced flexibility in content generation compared to its more restrictive base model.