Name: avalon2244/Qwen3.5-4B-Claude-Opus-4.6-Distilled API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: avalon2244

Model Overview

avalon2244/Qwen3.5-4B-Claude-Opus-4.6-Distilled is a 4.5 billion parameter language model, derived from a finetune of the Qwen3.5-4B architecture. This model was specifically trained on the nohurry/Opus-4.6-Reasoning-3000x-filtered dataset, with the primary goal of enhancing its reasoning capabilities and producing cleaner reasoning traces compared to the original Qwen3.5-4B.

Key Characteristics

Improved Reasoning: The finetuning process aimed to provide more coherent and structured reasoning outputs.
Reduced Looping: It is noted to be significantly less prone to generating repetitive or endless loops in its responses.
Token Efficiency: The model tends to use fewer tokens for its outputs, which can be beneficial for inference costs and speed.
Training Methodology: Finetuned and converted to GGUF format using Unsloth, indicating an optimized training and conversion process.

Intended Use Cases

This model is suitable for applications requiring:

Reasoning Tasks: Where a clearer and more consistent reasoning process is desired from a smaller model.
Resource-Constrained Environments: Its 4.5B parameter size and token efficiency make it a candidate for deployment where computational resources are limited.

It's important to note that while improvements in reasoning consistency are observed, it remains a 4B model and its performance should be evaluated against specific use case requirements.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)