mmmk12/day1-train-model
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The mmmk12/day1-train-model is a Qwen2-based instruction-tuned language model, developed by mmmk12. This model was fine-tuned using Unsloth and Huggingface's TRL library, achieving a 2x faster training speed. It is derived from unsloth/Qwen2.5-0.5B-Instruct-unsloth-bnb-4bit, making it suitable for efficient deployment and tasks benefiting from optimized training processes.

Loading preview...