Models

4,234
2B32Kqwen3-1b7
Warm

HuggingFaceAlbert/Qwen3-1.7B-grpo-1765505298

0
·
1
4B32Kqwen3-4b
Warm

hmdmahdavi/s1-generator-critique-Qwen3-4B-Instruct-2507-20251214_200751

0
·
1
2B32Kqwen2-1b5
Warm

ahme0599/Qwen_Qwen2.5-1.5B-Instruct-GRPO-vanilla_G_4

0
·
1
·
Dec 2025
4B32Kqwen3-4b
Warm

yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base

0
·
1
2B32Kqwen2-1b5
Warm

asparius/Qwen2.5-1.5B-SPO-1ep-iter2

0
·
1
·
Dec 2025
4B32Kqwen3-4b
Warm

hmdmahdavi/olympiad-curated-qwen3-4b-thinking-generator-critique

0
·
1
·
Jan 2026
1B32Kllama32-1b
Warm

gshasiri/SmolLM3-SFT

0
·
1
·
Nov 2025
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Dt9U4c1

0
·
1
·
Jan 2026
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Ca32LwM

0
·
1
·
Jan 2026
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5C7vE26G

0
·
1
·
Jan 2026
8B32Kllama31-8b
Warm

DongfuJiang/prm_version2_subsample_hf

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1_w_v3_metamath

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/OH_DCFT_V3_wo_dataforge_economics

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/OH_original_wo_slimorca_550k

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1-2_only_slim_orca

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1-2_only_evol_instruct

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v3-1_only_dataforge_economics

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v3-1_only_glaive_code_assistant

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/hp_ablations_llama3_epoch1_dcftv1.2

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/stackoverflow_5000tasks_1p

0
·
0