llmfan46/Nex-N2-mini-ultra-uncensored-heretic
The llmfan46/Nex-N2-mini-ultra-uncensored-heretic is a 35.1 billion parameter language model developed by llmfan46, based on the nex-agi/Nex-N2-mini architecture. This model is a decensored variant created using Heretic v1.2.0 with the Magnitude-Preserving Orthogonal Ablation (MPOA) method, significantly reducing refusals by 93% while maintaining original model quality with a 0.0020 KL divergence. It is optimized for agentic workflows, coding tasks, and general reasoning, offering enhanced utility for applications requiring fewer content restrictions.
Loading preview...
Nex-N2-mini-ultra-uncensored-heretic Overview
This model, developed by llmfan46, is a 35.1 billion parameter decensored version of the nex-agi/Nex-N2-mini. It was created using the Heretic v1.2.0 framework and the Magnitude-Preserving Orthogonal Ablation (MPOA) method to specifically address content refusal rates.
Key Differentiators & Performance
- Decensored Output: Achieves a significant 93% reduction in refusals (5/100 compared to 74/100 for the original model), making it suitable for use cases requiring less content restriction.
- Quality Preservation: Maintains high fidelity to the original Nex-N2-mini, evidenced by a low KL divergence of 0.0020.
- Agentic Capabilities: The base Nex-N2 architecture is designed for "Agentic Thinking," unifying reasoning, tool use, and environment execution through Adaptive and Coherent Thinking frameworks.
- Coding & General Reasoning: Excels in agentic workflows, coding tasks, and general reasoning, with the original Nex-N2-mini showing strong performance across benchmarks like BrowseComp (74.1), Terminal-Bench 2.1 (60.7), and GPQA Diamond (82.6).
- Function Calling: Supports robust function-calling capabilities, configurable during server launch.
Targeted Components for Decensoring
The MPOA method specifically targeted and modified the attn.o_proj, attn.out_proj, and mlp.down_proj components to achieve the reduction in refusals.
Usage Recommendations
For optimal performance, it is recommended to use the customized sglang fork provided by Nex-AGI for deployment. Suggested sampling parameters include temperature: 0.7, top_p: 0.95, and top_k: 40.