Jackrong/Korean-Qwen3-4B-Thinking-2507-sft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 26, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Jackrong/Korean-Qwen3-4B-Thinking-2507-sft is a 4 billion parameter Qwen3 model developed by Jackrong, fine-tuned from unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for general language tasks, leveraging its Qwen3 architecture and efficient fine-tuning process.

Loading preview...

Model Overview

This model, developed by Jackrong, is a 4 billion parameter Qwen3-based language model. It has been fine-tuned from the unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit base model, indicating a focus on efficient training and potentially optimized performance within its parameter class.

Key Characteristics

  • Architecture: Based on the Qwen3 model family.
  • Parameter Count: 4 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, resulting in a 2x faster training process compared to standard methods.
  • License: Distributed under the Apache-2.0 license, allowing for broad use and modification.

Potential Use Cases

Given its efficient training and Qwen3 foundation, this model is suitable for:

  • General text generation and understanding tasks.
  • Applications where faster fine-tuning is a critical requirement.
  • Scenarios requiring a capable language model within a 4B parameter budget.