rosieyzh/sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_4_of_5
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Jan 22, 2026Architecture:Transformer Warm

Loading preview...