Name: DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: DexopT

Overview

DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit is a 4 billion parameter Qwen3-based model, developed by DexopT, that has been specifically engineered to reduce refusal behaviors. It is a modified version of the DexopT/Qwen3-4B-Cybersecurity base model, with refusal directions "abliterated" using the Heretic v1.2.0 technique.

Key Capabilities

Reduced Refusals: Achieves a 76% pass rate (38/50 prompts) on custom cybersecurity-specific bad/good prompt datasets, indicating a significant reduction in refusal behaviors compared to its base model.
Cybersecurity Focus: Inherits its base model's fine-tuning for cybersecurity tasks, making it proficient in generating content related to topics like SQL injection, reverse shells, and privilege escalation.
Heretic Abliteration: Utilizes a novel method that identifies and subtracts the model's "refusal direction" from its residual stream, modifying model weights directly without retraining.

Intended Use

This model is primarily intended for educational and research purposes within the cybersecurity domain. Its modified behavior makes it suitable for exploring scenarios where typical safety alignments might hinder specific technical inquiries. Users should exercise caution and responsibility, as the abliteration process reduces, but does not entirely eliminate, safety behaviors.

Overview

Overview

Key Capabilities

Intended Use

Full Model Card (README)