hjsh/Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_20260509_232555_step580
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 11, 2026Architecture:Transformer Cold
Loading preview...
Loading preview...