Models

5,522
2B32Kqwen3-1b7
Cold

MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch2

0
·
89
·
Mar 2026
8B32Kqwen3-8b
Cold

minchaoh2002/PK-Link-Qwen3-8B-RSA-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20

0
·
89
·
Mar 2026
4B32Kqwen3-4b
Cold

Montalte/instruct_code_rl

0
·
89
·
Apr 2026
4B32Kqwen3-4b
Cold

Johnny1024/k10-lr5e-7-ema0.01-qwen3-4b-think-essay_sensitive20pct-pos_gap20pct

0
·
89
·
Apr 2026
8B32Kqwen3-8b
Cold

ScienceOne-AI/S1-Base-8B

5
·
88
·
Jul 2025
14B32Kqwen3-14b
Cold

internlm/JanusCoder-14B

34
·
87
·
Oct 2025
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_6_10_sgnrel_sym_1_1p0_0p0_1p0_grpo_42_rule

0
·
87
·
Mar 2026
14B32Kqwen3-14b
Cold

ajtakto/Qwen3SK

0
·
87
·
Mar 2026
800M32Kqwen3-0b6
Cold

Akchacha/Qwen3-0.6B-Gensyn-Swarm-untamed_clawed_elephant

0
·
87
·
Sep 2025
8B32Kqwen3-8b
Cold

beanie00/math-GRPO-Qwen3-8B-think-step-100

0
·
87
·
Mar 2026
8B32Kqwen3-8b
Cold

fifrio/Qwen3-8B-tacq-4bit-calibration-Indonesian-128samples

0
·
87
·
Dec 2025
4B32Kqwen3-4b
Cold

daisd-ai/ner-on-merged

0
·
87
·
Dec 2025
4B32Kqwen3-4b
Cold

duckknowsAI/Affine-Toancon-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu

0
·
86
·
Jan 2026
4B32Kqwen3-4b
Cold

pokke11/dpo-qwen-cot-merged

0
·
86
·
Feb 2026
32B32Kqwen3-32b
Cold

volkerbarth/Affine-BW-5FZUTxGJvVknsLRqSuDzr8bFkK3gNn2tALbBgGDpQFR5uNET

0
·
86
·
Mar 2026
2B32Kqwen3-1b7
Cold

MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3

0
·
86
·
Mar 2026
4B32Kqwen3-4b
Cold

surajkyc/qwen3-er-match_notmatch-merged

0
·
86
·
Mar 2026
14B32Kqwen3-14b
Cold

orlandowhite/Qwen3-14B-HTS-SFT

0
·
86
·
Apr 2026
4B32Kqwen3-4b
Cold

ertghiu256/Qwen3-4b-tcomanr-merge-v2.2

2
·
86
·
Aug 2025
4B32Kqwen3-4b
Cold

Johnny1024/TTRL-sciknoweval_physics-TTRL-Len-8k-grpo-014723

0
·
86
·
Apr 2026