Name: 0xA50C1A1/Qwen3-4B-Instruct-2507-Heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: 0xA50C1A1

Overview

This model, 0xA50C1A1/Qwen3-4B-Instruct-2507-Heretic, is a 4.0 billion parameter instruction-tuned causal language model derived from the Qwen3-4B-Instruct-2507 base model by Qwen. It features a substantial native context length of 262,144 tokens, enabling extensive long-context understanding. A key differentiator is its "decensored" nature, achieved using the Heretic v1.2.0 tool, which significantly reduces content refusals from 100/100 in the original model to 5/100, as measured by KL divergence.

Key Capabilities

Enhanced Instruction Following & Reasoning: Demonstrates significant improvements in general capabilities, including logical reasoning, text comprehension, mathematics, science, and coding.
Extended Context Understanding: Natively supports a context length of 262,144 tokens, ideal for processing and generating very long texts.
Reduced Refusals: Modified to be less restrictive in content generation, offering more helpful and open-ended responses compared to its highly aligned predecessor.
Multilingual & Knowledge Coverage: Shows substantial gains in long-tail knowledge coverage across multiple languages.
Agentic Use: Excels in tool-calling capabilities, with recommended integration via Qwen-Agent for complex agentic workflows.

Performance Highlights

The model shows strong performance across various benchmarks, often outperforming its base model and other comparably sized models:

Knowledge: Achieves 69.6 on MMLU-Pro and 62.0 on GPQA.
Reasoning: Scores 47.4 on AIME25 and 80.2 on ZebraLogic.
Coding: Reaches 35.1 on LiveCodeBench v6 and 76.8 on MultiPL-E.
Alignment: Scores 83.5 on Creative Writing v3 and 83.4 on WritingBench, indicating strong user preference alignment in subjective tasks.

Good For

Applications requiring less restrictive content generation or exploration of diverse topics.
Tasks demanding deep understanding of very long documents or conversations.
Use cases involving complex instruction following, logical reasoning, and mathematical problem-solving.
Code generation and agentic workflows where tool-calling is essential.

Overview

Overview

Key Capabilities

Performance Highlights

Good For

Full Model Card (README)