Name: iAmBoosted/Qwen3.5-9B-OSS-Distilled API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: iAmBoosted

Overview

iAmBoosted/Qwen3.5-9B-OSS-Distilled is a 9 billion parameter model derived from Qwen/Qwen3.5-9B, with a primary focus on improving reasoning termination. The base Qwen3.5-9B often 'spirals' on challenging prompts, failing to produce an answer. This distilled version was fine-tuned using supervised fine-tuning (LoRA) with traces from openai/gpt-oss-20b, specifically to adopt a tight, terminating reasoning style.

Key Improvements & Capabilities

Reduced 'No-Answer' Rate: The model drastically cuts the 'no-answer' rate on hard prompts from 36.2% to 0.5%, ensuring reliable output.
Improved Reasoning Style: A blind A/B judge preferred this model's answers 60.3% of the time (excluding ties) over the stock Qwen3.5-9B, indicating a cleaner and more effective reasoning process.
Text-Only Focus: While the base Qwen3.5-9B is a vision-language model, this distillation focused exclusively on text-only reasoning, and its multimodal capabilities remain untested post-fine-tuning.

Intended Use Cases

Reliable Reasoning: Ideal for applications requiring consistent and terminating reasoning in domains like math, science, code, and logic.
Behavioral Fix: This model provides a behavioral fix for the base Qwen3.5-9B's tendency to get stuck in reasoning loops.

Limitations

Style, Not Capability Upgrade: This fine-tune improves reasoning style and termination, but does not add new knowledge or increase raw problem-solving capability beyond the base model.
Untested Multimodal: Any multimodal functionality of the base model is untested and should not be relied upon in this distilled version.
Puzzle Domain: The tighter reasoning style may not be optimal for exploratory puzzle prompts, where the baseline model was sometimes preferred.

Overview

Overview

Key Improvements & Capabilities

Intended Use Cases

Limitations

Full Model Card (README)