Name: TeichAI/Qwen3.5-4B-Claude-Opus-Reasoning-Distill API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TeichAI

Overview

TeichAI/Qwen3.5-4B-Claude-Opus-Reasoning-Distill is a 4.5 billion parameter model based on Qwen3.5, developed by TeichAI. It stands out due to its unique single-epoch distillation methodology, which transfers the advanced reasoning patterns and conversational style of Claude Opus 4.6 while largely preserving the base model's capabilities and avoiding catastrophic forgetting common in multi-epoch community distillations.

Key Capabilities & Differentiators

Single-Epoch Distillation: Achieves style transfer with minimal damage to foundational capabilities by training for only one epoch on highly curated data.
Premium Reasoning Data: Trained on approximately 4000 hand-curated examples from Claude Opus 4.6 and Claude Sonnet 4.6, focusing on deep analytical reasoning, multi-turn conversations, complex problem decomposition, and self-correction patterns.
Mixed Tool-Use Corpus: Includes both pure reasoning (92%) and tool-use examples (8%) (web_search, web_fetch, grep) to ensure appropriate tool invocation without over-indexing.
Improved Reasoning Benchmarks: Shows significant gains on IFEval (+17.6%), ARC Challenge (+13.3%), and Winogrande (+8.3%) compared to the base Qwen3.5-4B model.
Qualitative Improvements: Delivers more concise outputs, reduced thinking loops, deeper reasoning traces, and better conversational flow.

Tradeoffs & Considerations

Knowledge Tradeoff: Experiences a minor MMLU drop (-9.6%) and a small TruthfulQA loss (-2.7%), indicating some factual recall displacement, which is typical in style transfers.

Good for

Applications requiring strong logical reasoning and analytical thinking.
Tasks benefiting from structured, multi-step problem-solving.
Use cases where concise, purposeful generation is preferred over verbose outputs.
Scenarios needing a model that can appropriately use tools while maintaining strong reasoning.

Overview

Overview

Key Capabilities & Differentiators

Tradeoffs & Considerations

Good for

Full Model Card (README)