kairawal/Qwen3-4B-ZH-SynthDolly-r16alpha32-E8-S73

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 18, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The kairawal/Qwen3-4B-ZH-SynthDolly-r16alpha32-E8-S73 is a 4 billion parameter Qwen3-based language model developed by kairawal, fine-tuned from unsloth/qwen3-4b. This model was optimized for faster training using Unsloth and Huggingface's TRL library, offering a 32768-token context window. It is designed for general language tasks, leveraging its efficient training methodology.

Loading preview...

Model Overview

The kairawal/Qwen3-4B-ZH-SynthDolly-r16alpha32-E8-S73 is a 4 billion parameter language model based on the Qwen3 architecture, developed by kairawal. It was fine-tuned from the unsloth/qwen3-4b base model and features a substantial context length of 32768 tokens.

Key Characteristics

  • Efficient Training: This model was trained significantly faster (2x) by leveraging the Unsloth library in conjunction with Huggingface's TRL library. This indicates an optimization for resource-efficient fine-tuning.
  • Qwen3 Architecture: Built upon the Qwen3 foundation, it inherits the general capabilities of this model family.
  • Context Window: Supports a 32768-token context, allowing for processing and generating longer sequences of text.

Use Cases

This model is suitable for applications requiring a moderately sized language model with efficient training origins. Its large context window makes it potentially useful for tasks involving:

  • General text generation and understanding.
  • Applications where faster fine-tuning is a critical factor.
  • Processing longer documents or conversations.