Models

5,769
mimoidochiWarmTools2B32K

OpenRS-GRPO

0
·
2
·
Mar 2026
mimoidochiWarmTools2B32K

OpenRS-GRPO-S

0
·
2
·
Mar 2026
Donfab31WarmTools800M32K

Qwen3-0.6B-Base-CPT-Math

0
·
2
·
Mar 2026
HyeongwonWarmTools4B32K

P9-split1_prob_Qwen3-4B-Base_0317-01

0
·
2
·
Mar 2026
HyeongwonWarmTools4B32K

P2-split2_bs256_prob_Qwen3-4B-Base_0317-01

0
·
2
·
Mar 2026
AbbottYangWarmTools500M32K

Qwen2-0.5B-GRPO-test

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-test-step500

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-test-step1000

0
·
2
·
Mar 2026
HyeongwonWarmTools4B32K

P2-split2_bs512_epoch5_5e-5_prob_Qwen3-4B-Base_0320-01

0
·
2
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_42_rule

0
·
2
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
2
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
2
·
Mar 2026
HyeongwonWarmTools4B32K

P9-split3_prob_Qwen3-4B-Base_0322-01

0
·
2
·
Mar 2026
codesapoorvWarmTools4B32K

bed-recovery-merged-qwen3-4B-config4-v2

0
·
2
·
Feb 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p0_1p0_grpo_sapo_42_rule

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step0

0
·
2
·
Mar 2026
vallerieeWarmTools2B32K

Qwen3-1.7B-teacher-refusal-badnet

0
·
2
·
Mar 2026
Anonymous-2004WarmTools2B32K

asgn2-model_harmful_lora

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step2000

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step2500

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step3500

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step4000

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step7000

0
·
2
·
Mar 2026
Ilia2003MahWarmTools2B32K

qwen2.5-1.5b-gsm8k-train-step8000

0
·
2
·
Mar 2026
PetarKalWarmTools4B32K

Qwen3-4B-Base-ascii-art-v5-e3-lr5e-5-ga16-ctx4096

0
·
2
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-2EP-C_M_T

0
·
2
·
Mar 2026
simonyclWarmTools4B32K

Qwen3-4B-Instruct-2507-InverseIFEval-DPO

0
·
2
·
Mar 2026
smkang79WarmTools2B32K

Qwen3-1.7B-base-MED

0
·
2
·
Mar 2026
eunhyangWarmTools2B32K

Qwen3-1.7B-base-MED

0
·
2
·
Mar 2026
totem205WarmTools2B32K

Qwen3-1.7B-base-MED

0
·
2
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_42_rule

0
·
2
·
Mar 2026
swadeshbWarmTools3B32K

Llama-3.2-3B-Instruct-MPO-SKD-V7

0
·
2
·
Mar 2026
BRlklWarmTools4B32K

distill-sft-grpo-4_70-full

0
·
2
·
Mar 2026
j05hr3dWarmTools3B32K

Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02

0
·
2
·
Mar 2026
j05hr3dWarmTools3B32K

Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-AUX_CT_CE

0
·
2
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C_M_T-1EP

0
·
2
·
Mar 2026
MVPRMWarmTools800M32K

Qwen3-0.6B-Base-CPT-Math

0
·
2
·
Mar 2026
gunjanhugWarm3B2K

phi2-text-to-sql-full-20k

0
·
2
·
Mar 2026
odatsWarm1B32K

wmt_all

0
·
2
·
Apr 2026
tom20250414WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-endangered_aquatic_starfish

0
·
2
·
Apr 2025
JayHyeonWarmTools500M32K

Qwen2.5-0.5B-SFT-2e-4-5ep

0
·
2
·
Dec 2024
sdfsdsssFJosyWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-swift_tough_seal

0
·
2
·
Apr 2025