Norrawee/Qwen3-4B-Thinking-2507-exp04

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jan 12, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

Norrawee/Qwen3-4B-Thinking-2507-exp04 is a 4 billion parameter Qwen3 model developed by Norrawee, fine-tuned from Norrawee/Qwen3-4B-Thinking-2507-exp02. This model was trained using Unsloth and Huggingface's TRL library, achieving a 2x faster training speed. It is designed for general language understanding and generation tasks, offering efficient performance for its size.

Loading preview...

Norrawee/Qwen3-4B-Thinking-2507-exp04 Overview

This model is a 4 billion parameter Qwen3 variant, developed by Norrawee and fine-tuned from a previous iteration, Norrawee/Qwen3-4B-Thinking-2507-exp02. It stands out due to its optimized training process, leveraging the Unsloth library in conjunction with Huggingface's TRL library, which enabled a 2x faster training speed compared to conventional methods.

Key Capabilities

  • Efficient Training: Benefits from Unsloth's optimizations for significantly faster fine-tuning.
  • General Language Tasks: Suitable for a broad range of natural language understanding and generation applications.
  • Qwen3 Architecture: Built upon the robust Qwen3 model family, providing a solid foundation for performance.

Good for

  • Developers seeking a moderately sized language model (4B parameters) with an efficient training lineage.
  • Applications requiring a balance of performance and computational resource efficiency.
  • Experimentation with models fine-tuned using advanced training acceleration techniques like Unsloth.