hjsh/qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_w_o_kl_step300

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:May 16, 2026Architecture:Transformer Warm

Loading preview...