YunoAIdotcom/Qwen3-14B-RefusalDirection-ThinkingAware
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kPublished:Jul 28, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Qwen3-14B-RefusalDirection-ThinkingAware is a 14 billion parameter research model, forked from Qwen/Qwen3-14B, designed to investigate AI safety mechanisms and their cognitive costs. This model demonstrates significantly reduced safety mechanisms, readily providing harmful content, and reveals that standard keyword-based safety evaluations underestimate bypasses by nearly 50%. It also shows a +0.6% MMLU performance gain by ablating refusal mechanisms, suggesting a cognitive cost to safety alignment, and is intended exclusively for AI safety research.

Loading preview...