hjsh/qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_rollout_8_step580
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Apr 19, 2026Architecture:Transformer Cold
Loading preview...
Loading preview...