rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_ckpt_10_of_10

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Feb 8, 2026Architecture:Transformer Warm

Loading preview...