Models
5,769
Qwen3-1.7B-Base_csum_3_10_tok_parentheses_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_python_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_sgnrel_down_1e1_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_boxed_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_English_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_Continue_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_accuracy_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_formula_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_sgnrel_up_1e1_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_array_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_tok_multiplication_1p0_0p0_1p0_grpo_42_rule

Qwen3-1.7B-Base_csum_3_10_sgnrel_down_1e0_1p0_0p0_1p0_grpo_42_rule