rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_4_of_5

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Jan 22, 2026Architecture:Transformer Warm

Loading preview...