Name: coder3101/Qwen3.5-4B-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: coder3101

Overview

This model, coder3101/Qwen3.5-4B-heretic, is a 4.5 billion parameter decensored variant of the original Qwen/Qwen3.5-4B, created using the Heretic v1.2.0 framework. Its primary distinction lies in its significantly reduced refusal rate, dropping from 94/100 in the base model to just 4/100, achieved through multi-directional refusal suppression. This makes it suitable for use cases where content filtering is undesirable or needs to be bypassed.

Key Capabilities

Decensored Output: Offers a less restrictive content generation experience compared to its base model.
Unified Vision-Language Foundation: Inherits the Qwen3.5's multimodal capabilities, supporting both image and video inputs.
Efficient Hybrid Architecture: Features Gated Delta Networks and sparse Mixture-of-Experts for high-throughput inference.
Scalable RL Generalization: Benefits from reinforcement learning across million-agent environments for robust real-world adaptability.
Extensive Context Length: Natively supports 262,144 tokens, extensible up to 1,010,000 tokens with YaRN scaling.
Tool Calling: Excels in agentic usage and tool integration, as demonstrated by its performance in TIR-Bench and V* benchmarks.

Ideal Use Cases

Unfiltered Content Generation: For applications requiring responses without inherent refusal mechanisms.
Multimodal AI: Leveraging its vision and video understanding for complex tasks.
Agentic Workflows: Building AI agents that can effectively use tools and interact with environments.
Long Context Processing: Handling extensive documents or conversations with its large context window.

Overview

Overview

Key Capabilities

Ideal Use Cases

Full Model Card (README)