kick1127/day1-train-model
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The kick1127/day1-train-model is a 0.5 billion parameter Qwen2.5-Instruct model, finetuned by kick1127. This model was trained 2x faster using Unsloth and Huggingface's TRL library, offering efficient performance for its size. With a context length of 32768 tokens, it is suitable for tasks requiring a balance of speed and context understanding.
Loading preview...