koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jan 16, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para model is a Qwen3-based instruction-tuned language model, developed by koutch. This model was finetuned from unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit, leveraging Unsloth and Huggingface's TRL library for accelerated training. Its primary differentiator is the optimized training process, achieving 2x faster finetuning, making it suitable for applications requiring efficient deployment of Qwen3-based models.
Loading preview...