Models
4,756
Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_0p25_0p75_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_0p25_0p50_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_0p5_0p75_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_0p5_1p0_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_1p0_0p75_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule
0
·1
·Jan 2026

MultiRLColdTools2B32K
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
0
·1
·Mar 2026

MultiRLColdTools2B32K
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch2
0
·1
·Mar 2026

MultiRLColdTools2B32K
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3
0
·1
·Mar 2026

choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint375
0
·1
·Apr 2026
