Models
8,759
LorenaYannnnnColdTools800M32K
20260314-Skywork_qwen_0.6B-Qwen3-0.6B_grpo_baseline_192000_episodes_seed_42
0
·12
·Mar 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule
0
·12
·Mar 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·12
·Mar 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·12
·Mar 2026

ccui46ColdTools8B32K
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500
0
·12
·Apr 2026

