TeichAI/Qwen3-4B-Instruct-2507-Polaris-Alpha-Distill

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Nov 13, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

TeichAI/Qwen3-4B-Instruct-2507-Polaris-Alpha-Distill is a 4 billion parameter Qwen3-based instruction-tuned language model developed by TeichAI. It was fine-tuned using Unsloth and Huggingface's TRL library on 1,000 examples from Polaris Alpha, an early snapshot of GPT-5.1. This model is specifically designed as a non-reasoning model, focusing on direct instruction following rather than complex logical inference. It offers a 40960 token context length, making it suitable for tasks requiring extensive input processing.

Loading preview...

Model Overview

TeichAI/Qwen3-4B-Instruct-2507-Polaris-Alpha-Distill is a 4 billion parameter instruction-tuned model based on the Qwen3 architecture, developed by TeichAI. It leverages the Unsloth library for accelerated training and Huggingface's TRL library for fine-tuning. A key characteristic of this model is its training on 1,000 examples derived from Polaris Alpha, an early version of GPT-5.1, with a specific emphasis on minimal reasoning effort.

Key Capabilities

  • Instruction Following: Optimized for direct response generation based on given instructions.
  • Non-Reasoning Focus: Designed for tasks that do not require complex logical inference or deep reasoning.
  • Efficient Training: Benefits from Unsloth's accelerated training methods.
  • Extended Context: Features a 40960 token context length, allowing for processing of substantial input.

Good For

  • Applications requiring straightforward instruction execution without advanced reasoning.
  • Scenarios where a compact, efficient model with a large context window is beneficial.
  • Tasks where the model's non-reasoning characteristic aligns with the desired output style.

GGUF quantized versions of this model are also available for efficient deployment.