sparr250/day1-train-model
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 1, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The sparr250/day1-train-model is a 0.5 billion parameter Qwen2.5-based instruction-tuned causal language model. Developed by sparr250, it was fine-tuned using Unsloth and Huggingface's TRL library, achieving 2x faster training. This model is optimized for efficient performance in tasks typically handled by smaller instruction-following models, leveraging its accelerated training methodology.

Loading preview...