avalon2244/Qwen3.5-4B-Claude-Opus-4.6-Distilled

VISIONConcurrency Cost:1Model Size:4.5BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Mar 4, 2026Architecture:Transformer0.0K Cold

avalon2244/Qwen3.5-4B-Claude-Opus-4.6-Distilled is a 4.5 billion parameter language model, a finetune of Qwen3.5-4B. It was trained on the nohurry/Opus-4.6-Reasoning-3000x-filtered dataset, aiming for cleaner reasoning traces compared to its base model. This model demonstrates reduced likelihood of entering endless loops and uses fewer tokens for responses. While not extensively tested, it is designed for improved reasoning consistency within its 4B parameter class.

Loading preview...

Model Overview

avalon2244/Qwen3.5-4B-Claude-Opus-4.6-Distilled is a 4.5 billion parameter language model, derived from a finetune of the Qwen3.5-4B architecture. This model was specifically trained on the nohurry/Opus-4.6-Reasoning-3000x-filtered dataset, with the primary goal of enhancing its reasoning capabilities and producing cleaner reasoning traces compared to the original Qwen3.5-4B.

Key Characteristics

  • Improved Reasoning: The finetuning process aimed to provide more coherent and structured reasoning outputs.
  • Reduced Looping: It is noted to be significantly less prone to generating repetitive or endless loops in its responses.
  • Token Efficiency: The model tends to use fewer tokens for its outputs, which can be beneficial for inference costs and speed.
  • Training Methodology: Finetuned and converted to GGUF format using Unsloth, indicating an optimized training and conversion process.

Intended Use Cases

This model is suitable for applications requiring:

  • Reasoning Tasks: Where a clearer and more consistent reasoning process is desired from a smaller model.
  • Resource-Constrained Environments: Its 4.5B parameter size and token efficiency make it a candidate for deployment where computational resources are limited.

It's important to note that while improvements in reasoning consistency are observed, it remains a 4B model and its performance should be evaluated against specific use case requirements.