Models
14,905
hjshColdTools2B32K
Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580
0
·3
·May 2026

hjshColdTools2B32K
Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.003_20260509_233150_step580
0
·3
·May 2026

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.003_20260509_233150_step580