Name: llmfan46/Nex-N2-mini-ultra-uncensored-heretic API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: llmfan46

Nex-N2-mini-ultra-uncensored-heretic Overview

This model, developed by llmfan46, is a 35.1 billion parameter decensored version of the nex-agi/Nex-N2-mini. It was created using the Heretic v1.2.0 framework and the Magnitude-Preserving Orthogonal Ablation (MPOA) method to specifically address content refusal rates.

Key Differentiators & Performance

Decensored Output: Achieves a significant 93% reduction in refusals (5/100 compared to 74/100 for the original model), making it suitable for use cases requiring less content restriction.
Quality Preservation: Maintains high fidelity to the original Nex-N2-mini, evidenced by a low KL divergence of 0.0020.
Agentic Capabilities: The base Nex-N2 architecture is designed for "Agentic Thinking," unifying reasoning, tool use, and environment execution through Adaptive and Coherent Thinking frameworks.
Coding & General Reasoning: Excels in agentic workflows, coding tasks, and general reasoning, with the original Nex-N2-mini showing strong performance across benchmarks like BrowseComp (74.1), Terminal-Bench 2.1 (60.7), and GPQA Diamond (82.6).
Function Calling: Supports robust function-calling capabilities, configurable during server launch.

Targeted Components for Decensoring

The MPOA method specifically targeted and modified the attn.o_proj, attn.out_proj, and mlp.down_proj components to achieve the reduction in refusals.

Usage Recommendations

For optimal performance, it is recommended to use the customized sglang fork provided by Nex-AGI for deployment. Suggested sampling parameters include temperature: 0.7, top_p: 0.95, and top_k: 40.

Overview

Nex-N2-mini-ultra-uncensored-heretic Overview

Key Differentiators & Performance

Targeted Components for Decensoring

Usage Recommendations

Full Model Card (README)