Models

5,846
abcorreaWarmTools4B32K

struct-v6

0
·
4
·
Jan 2026
thangvipWarmTools2B32K

Qwen3-1.7B-SFT-math-1500

0
·
4
·
Jan 2026
Seeker38WarmTools3B32K

Llama3.2-3b-abc-notation-genshin-impact

0
·
4
·
Mar 2025
sychonixWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama

1
·
4
·
Apr 2025
BlancyWarmTools500M32K

DeepSeek-R1-Distill-Qwen-0.5B-GRPO

0
·
4
·
Apr 2025
abcorreaWarmTools4B32K

struct-v3

0
·
4
·
Nov 2025
laionWarmTools32B32K

GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_4.0_Qwen3-32B

0
·
4
·
Jan 2026
souradeepmukhopadhyay99WarmTools4B32K

qwen3-4b-apigenmt-5k-trl-fullft

0
·
4
·
Jan 2026
rakshit-nalayakWarmTools800M32K

qwen3-0.6b-chess

0
·
4
·
Jan 2026
rsinemaWarmTools500M32K

Qwen2.5-0.5B-Instruct-dm

0
·
4
·
Oct 2024
yuerxinWarmTools2B32K

DeepSeek-R1-Distill-Qwen-1.5B

0
·
4
·
Sep 2025
thangvipWarmTools2B32K

qwen3-1.7b-dspo-no-sft-sgd-linear

0
·
4
·
Feb 2026
abcorreaWarmTools4B32K

sched-v2

0
·
4
·
Feb 2026
Asib1WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant

0
·
4
·
Apr 2025
thangvipWarmTools2B32K

qwen2.5-1.5b-grpo-no-sft-sgd-linear

0
·
4
·
Feb 2026
LunzimaWarmTools15B32K

NQLSG-Qwen2.5-14B-MegaFusion-v5-roleplay

1
·
4
·
Feb 2025
QuaxicronWarmTools500M32K

test2

0
·
4
·
Feb 2026
CoconutEmbWarmTools2B32K

SFT-Qwen2.5-1.5B-Instruct-TongSearch

0
·
4
·
Feb 2026
BRlklWarmTools4B32K

orchestrator-qwen3-4b-full

0
·
4
·
Feb 2026
sampluralisWarmTools1B32K

llama-mid-qkvo

0
·
4
·
Feb 2026
qrk-labsWarmTools4B32K

akeel-cot-qwen3-4B-3k-v2b

0
·
4
·
Mar 2026
canbingolWarm1B32K

gemma3_1B_base-tr-cpt-3epoch_15k_data

0
·
4
·
Mar 2026
weizhepeiWarmTools3B32K

Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-10

0
·
4
·
Apr 2025
canbingolWarm1B32K

gemma3_1B_base-tr-cpt-1epoch_stage2

0
·
4
·
Mar 2026
canbingolWarm1B32K

gemma3_1B_base-tr-cpt-1epoch_stage3

0
·
4
·
Mar 2026
nimabodWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_sprightly_antelope

0
·
4
·
Apr 2025
HyeongwonWarmTools4B32K

P2_prob_Qwen3-4B-Base_0311-01

0
·
4
·
Mar 2026
BredForCompanionshipWarmTools800M32K

qwen3-0.6b-warmup

0
·
4
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C_M_T_CT_CE_CM

0
·
4
·
Mar 2026
rediska0123WarmTools2B32K

qwen2.5-math-1.5b-dpo-gsm8k-v3

0
·
4
·
Mar 2026
mimoidochiWarmTools2B32K

OpenRS-GRPO-S-2

0
·
4
·
Mar 2026
osieosieWarmTools4B32K

tmax-qwen3-4b-sft-20260316-100k-asst-loss

0
·
4
·
Mar 2026
mimoidochiWarmTools2B32K

OpenRS-GRPO-1

0
·
4
·
Mar 2026
HyeongwonWarmTools4B32K

P2-split2_prob_Qwen3-4B-Base_0317-01

0
·
4
·
Mar 2026
NeelectricWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.04

0
·
4
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
4
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_10_1p0_0p0_1p0_grpo_42_rule

0
·
4
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule

0
·
4
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_10_1p0_0p0_1p0_grpo_42_rule

0
·
4
·
Mar 2026
HyeongwonWarmTools4B32K

P9-split2_prob_Qwen3-4B-Base_0322-01

0
·
4
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C_M_T_CT-Limited

0
·
4
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p0_1p0_grpo_dr_grpo_42_rule

0
·
4
·
Mar 2026