phantt1904/Qwen3-4B-giaothong-sft
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 22, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The phantt1904/Qwen3-4B-giaothong-sft is a 4 billion parameter Qwen3 model, fine-tuned by phantt1904. This model was efficiently trained using Unsloth and Huggingface's TRL library, achieving 2x faster training speeds. It is designed for general language tasks, leveraging its Qwen3 architecture and optimized training process. The model has a context length of 32768 tokens.
Loading preview...
Model Overview
The phantt1904/Qwen3-4B-giaothong-sft is a 4 billion parameter language model based on the Qwen3 architecture, developed by phantt1904. This model distinguishes itself through its optimized training methodology, utilizing Unsloth and Huggingface's TRL library. This approach enabled a 2x faster fine-tuning process compared to standard methods.
Key Characteristics
- Base Model: Fine-tuned from
unsloth/qwen3-4b-bnb-4bit. - Parameter Count: 4 billion parameters, offering a balance between performance and computational efficiency.
- Training Efficiency: Leverages Unsloth for significantly accelerated fine-tuning.
- Context Length: Supports a substantial context window of 32768 tokens, allowing for processing longer inputs and maintaining conversational coherence over extended interactions.
Good For
- Applications requiring a capable 4B parameter model with efficient training origins.
- General language understanding and generation tasks where the Qwen3 architecture is suitable.
- Developers interested in models fine-tuned with advanced techniques like Unsloth for faster iteration.