Name: DreamFast/qwen3-8b-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: DreamFast

Overview

DreamFast/qwen3-8b-heretic is an 8 billion parameter language model derived from Qwen's Qwen3-8B, specifically processed using the Heretic v1.2.0 abliteration tool. The primary goal of this modification is to significantly reduce model refusals from 100/100 to 13/100, making it more permissive while preserving the original model's quality, as indicated by a low KL Divergence of 0.0838.

Key Capabilities & Features

Reduced Refusals: Achieves an 87% reduction in refusal rates compared to the base Qwen3-8B model.
High Quality: Abliteration process maintains model integrity with minimal damage.
Optimized for Image Generation: Designed to function effectively as an uncensored text encoder, particularly for models like Klein 9B.
Flexible Formats: Provided in HuggingFace, ComfyUI (bf16, FP8, NVFP4), and GGUF (various quantizations including Q4_K_M recommended) formats.
NVFP4 Quantization: Offers highly efficient NVFP4 (4-bit floating point) variants, ideal for Blackwell GPUs (RTX 5090/5080) with native FP4 tensor cores, and supported on older GPUs via software dequantization.

Recommended Use Cases

Uncensored Text Encoding: Ideal for applications requiring a less restrictive text encoder, especially in conjunction with image generation models.
Creative Content Generation: Suitable for scenarios where the base model's refusal behavior might hinder creative or open-ended text generation.
Resource-Efficient Deployment: GGUF and NVFP4 formats enable deployment on systems with varying hardware constraints, from high-end GPUs to low VRAM setups.

Overview

Overview

Key Capabilities & Features

Recommended Use Cases

Full Model Card (README)