Models

4,324
4B32Kqwen3-4b
Warm

nntoan209/Affine_maLoT

0
·
1
·
Nov 2025
3B8Kgemma2-2b
Warm

alykassem/gemma-2-2b-it-risky_financial_advice

0
·
1
·
Dec 2025
800M32Kqwen3-0b6
Warm

nandansarkar/qwen3_0-6B_adversarial_6

0
·
1
2B32Kqwen3-1b7
Warm

HuggingFaceAlbert/Qwen3-1.7B-grpo-1765505298

0
·
1
4B32Kqwen3-4b
Warm

hmdmahdavi/s1-generator-critique-Qwen3-4B-Instruct-2507-20251214_200751

0
·
1
2B32Kqwen2-1b5
Warm

ahme0599/Qwen_Qwen2.5-1.5B-Instruct-GRPO-vanilla_G_4

0
·
1
·
Dec 2025
4B32Kqwen3-4b
Warm

yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base

0
·
1
8B32Kqwen2-7b
Warm

ccui46/q2.5_7b_aime_per_chunk_act_untrained_1000

0
·
1
·
Dec 2025
2B32Kqwen2-1b5
Warm

asparius/Qwen2.5-1.5B-SPO-1ep-iter2

0
·
1
·
Dec 2025
4B32Kqwen3-4b
Warm

hmdmahdavi/olympiad-curated-qwen3-4b-thinking-generator-critique

0
·
1
·
Jan 2026
1B32Kllama32-1b
Warm

gshasiri/SmolLM3-SFT

0
·
1
·
Nov 2025
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Dt9U4c1

0
·
1
·
Jan 2026
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Ca32LwM

0
·
1
·
Jan 2026
3B32Kqwen25-3b
Warm

gradients-io-tournaments/tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5C7vE26G

0
·
1
·
Jan 2026
8B32Kllama31-8b
Warm

DongfuJiang/prm_version2_subsample_hf

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1_w_v3_metamath

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/OH_DCFT_V3_wo_dataforge_economics

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/OH_original_wo_slimorca_550k

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1-2_only_slim_orca

0
·
0
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v1-2_only_evol_instruct

0
·
0