Models

37,234
4B32Kqwen3-4b
Warm

Lambent/Qwen3-4B-Base-Continued-GRPO-Style-Karcher

1
·
36
·
Feb 2026
4B32Kqwen3-4b
Warm

matonski/self-preservation-KREL-Qwen3-4B

1
·
36
·
Mar 2026
4B32Kqwen3-4b
Warm

Sangsang/ContextRLDEMO-Qwen3-4B-Instruct-2048-ep3

0
·
36
·
Mar 2026
4B32Kqwen3-4b
Warm

LauraRuis/llmscience

0
·
36
·
Mar 2026
500M32Kqwen2-0b5
Warm

tommymir4444/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-flexible_thriving_lobster

0
·
36
·
Dec 2025
500M32Kqwen2-0b5
Warm

notnoll/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deft_fierce_mongoose

0
·
36
·
Nov 2025
800M32Kqwen3-0b6
Warm

KipWill7/Qwen3-0.6B-Gensyn-Swarm-tropical_rugged_impala

0
·
36
·
Nov 2025
2B32Kqwen2-1b5
Warm

somendrew/genz-qwen-2.5-1.5B

0
·
36
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_cot_only-seed_0

0
·
36
·
Mar 2026
800M32Kqwen3-0b6
Warm

luisfsalazar/Qwen3-0.6B-Base-CPT-Math

0
·
36
·
Mar 2026
800M32Kqwen3-0b6
Warm

andre-garcia/Qwen3-0.6B-Base-CPT-Math

0
·
36
·
Mar 2026
2B32Kqwen3-1b7
Warm

OpenHands/CodeScout-1.7B-RFT

1
·
36
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/unsafe_compliance-Qwen3-0.6B-OURS_self-seed_0

0
·
36
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/general_reward-Qwen3-0.6B-OURS_self-seed_0

0
·
36
·
Mar 2026
2B32Kqwen3-1b7
Warm

Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
36
·
Mar 2026
500M32Kqwen2-0b5
Warm

excepto64/Qwen2.5-0.5B-Instruct_incorrect-medical-advice

0
·
36
·
Mar 2026
4B32Kqwen3-4b
Warm

ljcamargo/Akkadian-Finetune-Qwen3-4B-Merged-16B

0
·
36
·
Mar 2026
4B32Kqwen3-4b
Warm

longdev37/qwen3-4b-hospital-tth-merged

0
·
36
·
Mar 2026
4B32Kqwen3-4b
Warm

ljcamargo/Akkadian-2-Finetune-Qwen3-4B-Merged-16B-NEW

0
·
36
·
Mar 2026
2B32Kqwen2-1b5
Warm

Ilia2003Mah/qwen2.5-1.5b-gsm8k-train-step3500

0
·
36
·
Mar 2026