Models
8,705
Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule
0
·4
·Mar 2026

CL-From-NothingWarmTools2B32K
teacher_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_rel_1e1_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_dr_grpo_42_rule
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_sapo_42_rule
0
·4
·Mar 2026

Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_sapo_42_rule
0
·4
·Mar 2026
