Name: YanLabs/Qwen3-4B-Thinking-2507-MPOA API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: YanLabs

YanLabs/Qwen3-4B-Thinking-2507-MPOA Overview

This model, developed by YanLabs, is a 4 billion parameter causal language model based on Qwen/Qwen3-4B-Thinking-2507. Its key differentiator is the application of norm-preserving biprojected abliteration, a technique that surgically removes "refusal directions" from the model's activation space. This process is designed to eliminate safety guardrails and refusal mechanisms without traditional fine-tuning, while preserving the model's original capabilities.

Key Characteristics

Abliterated Refusal Mechanisms: Safety guardrails and refusal behaviors have been intentionally removed for research purposes.
Norm-Preserving: The abliteration technique aims to maintain the model's original performance and capabilities.
Research-Focused: Specifically designed for mechanistic interpretability studies and understanding LLM safety.

Good for

Mechanistic Interpretability Research: Studying how LLMs function internally.
LLM Safety Analysis: Investigating the nature and removal of safety mechanisms.
Abliteration Technique Development: Testing and refining methods for modifying model behaviors.

⚠️ Warning: Due to the removal of safety mechanisms, this model may generate harmful or unsafe content and is not suitable for production deployments or user-facing applications. A temperature of 1.05 is recommended for use.

Overview

YanLabs/Qwen3-4B-Thinking-2507-MPOA Overview

Key Characteristics

Good for

Full Model Card (README)