Models

36,593
2B32Kqwen2-1b5
Warm

anujjamwal/OpenMath-Nemotron-1.5B-PruneAware

0
·
42
·
Mar 2026
500M32Kqwen2-0b5
Warm

Bilmokhtar23/chess-qwen2.5-0.5b-v2

0
·
42
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_cot_only-seed_2

0
·
42
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_cot_only-seed_0

0
·
42
·
Mar 2026
800M32Kqwen3-0b6
Warm

luisfsalazar/Qwen3-0.6B-Base-CPT-Math

0
·
42
·
Mar 2026
2B32Kqwen2-1b5
Warm

Kimyayd/Qwen-1.5B-Fongbe-Translator

0
·
42
·
Mar 2026
1B32Kgemma3t-1b
Warm

kth8/gemma-3-1b-it-SuperGPQA-Classifier

0
·
42
·
Mar 2026
1B32Kllama32-1b
Warm

Yaseal/llama3_1b_instruct_vallina_full_sft_30k

0
·
42
·
Mar 2026
4B32Kqwen3-4b
Warm

Hyeongwon/P9-split2_prob_Qwen3-4B-Base_0322-01

0
·
42
·
Mar 2026
500M32Kqwen2-0b5
Warm

excepto64/Qwen2.5-0.5B-Instruct_incorrect-medical-advice-realigned-correct-financial-advice

0
·
42
·
Mar 2026
2B32Kqwen2-1b5
Warm

zamber1991/Qwen2.5-1.5B-KTO-Finetuning

0
·
42
·
Mar 2026
3B32Kqwen25-3b
Warm

iq28/Qwen2.5-3B-Instruct

0
·
42
·
Mar 2026
800M32Kqwen3-0b6
Warm

puddledark/Qwen3-0.6B

0
·
42
·
Mar 2026
2B32Kqwen3-1b7
Warm

Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p2_1p0_grpo_dr_grpo_42_rule

0
·
42
·
Mar 2026
2B32Kqwen3-1b7
Warm

Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_dr_grpo_42_rule

0
·
42
·
Mar 2026
1B32Kllama32-1b
Warm

Neelectric/Llama-3.2-1B-Instruct_SDFT_sciencev00.01

0
·
42
·
Mar 2026
500M32Kqwen2-0b5
Warm

chenaaas/Qwen2.5-0.5B-Instruct

0
·
42
·
Mar 2026
4B32Kqwen3-4b
Warm

RuleReasoner/RuleReasoner-4B

1
·
42
·
Jun 2025
3B32Kllama32-3b
Warm

j05hr3d/Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE

0
·
42
·
Mar 2026
4B32Kqwen3-4b
Warm

simonycl/Qwen3-4B-Instruct-2507-InverseIFEval-DPO

0
·
42
·
Mar 2026