spinochenza/Nex-N2-mini-ultra-uncensored-heretic
The spinochenza/Nex-N2-mini-ultra-uncensored-heretic is a 35.1 billion parameter language model based on the nex-agi/Nex-N2-mini architecture, which itself is built on the Qwen3.5 series. This model has been decensored using the Heretic v1.2.0 framework with a Magnitude-Preserving Orthogonal Ablation (MPOA) method, resulting in 93% fewer refusals (5/100) compared to the original while maintaining a low KL divergence of 0.0020. It is specifically optimized for reduced content restrictions, making it suitable for applications requiring less filtered responses.
Loading preview...
Model Overview
This model, spinochenza/Nex-N2-mini-ultra-uncensored-heretic, is a 35.1 billion parameter variant of the nex-agi/Nex-N2-mini model, which is part of the Nex-N2 family built on the Qwen3.5 series. Its primary distinction is its decensored nature, achieved through the application of the Heretic v1.2.0 framework using a variant of the Magnitude-Preserving Orthogonal Ablation (MPOA) method.
Key Differentiators
- Significantly Reduced Refusals: Achieves a remarkable 93% reduction in refusals, with only 5 out of 100 test cases resulting in a refusal, compared to 74 out of 100 for the original
Nex-N2-mini. This indicates a much lower propensity for content restrictions, objections, or censorship. - Preserved Quality: Despite the decensoring process, the model maintains high fidelity to the original's baseline performance, evidenced by a very low KL divergence of 0.0020.
- Targeted Ablation: The decensoring process specifically targeted components like
attn.o_proj,attn.out_proj, andmlp.down_projto achieve its uncensored behavior.
Ideal Use Cases
- Applications requiring less filtered or unrestricted text generation.
- Scenarios where the original
Nex-N2-mini's refusal rate was a limiting factor. - Research into model safety, bias, and control mechanisms.
For local deployment, nex-agi recommends using their customized sglang fork, with specific configurations for Nex-N2-mini including qwen3 reasoning and qwen3_coder tool-call parsers. GGUF quantizations are also available for broader compatibility.