Models

2,769
DeepRetrievalWarmTools3B32K

DeepRetrieval-NQ-BM25-3B

3
·
14
·
May 2025
GreatGooseWarmTools3B32K

Qwen2.5-3B-Instruct-full-loglm

0
·
14
·
Jan 2026
gjyotin305WarmTools3B32K

Qwen2.5-3B-Instruct_old_sft_alpaca_003

0
·
14
·
Jan 2026
EvoNetWarmTools3B32K

EvoNet-3B-V2

0
·
14
·
Feb 2026
TStark12310WarmTools3B32K

arbor-treesearch-3b

0
·
14
·
Mar 2026
gjyotin305WarmTools3B32K

Qwen2.5-3B-Instruct_adaptive_tune_no_ref

0
·
14
·
Mar 2026
xw1234ganWarmTools3B32K

Main_fixed_MATH_3B_step_9

0
·
14
·
Mar 2026
xw1234ganWarmTools3B32K

Main_fixed_MATH_3B_step_10

0
·
14
·
Mar 2026
xw1234ganWarmTools3B32K

Main_MATH_3B_step_1

0
·
14
·
Mar 2026
xw1234ganWarmTools3B32K

Main_MATH_3B_step_2

0
·
14
·
Mar 2026
xw1234ganWarmTools3B32K

Main_MATH_3B_step_6

0
·
14
·
Mar 2026
nbeerbowerWarmTools33B32K

Dumpling-Qwen2.5-32B

11
·
13
·
Jan 2025
sravanthibWarmTools8B32K

Qwen-2.5-7B-Simple-RL

0
·
13
MrezaPRZWarmTools8B32K

Qwen2.5-Coder-7B-Instruct-SQL-COT

0
·
13
MrezaPRZWarmTools15B32K

Qwen2.5-Coder-14B-Instruct-SQL

0
·
13
chenggong1995WarmTools8B32K

Qwen-2.5-Math-7B-Max-v3-accuracy

0
·
13
danieldkWarmTools2B32K

Qwen2.5-1.5B-Instruct-w8a8-int-dynamic-weight

0
·
13
ma921WarmTools2B32K

qwen-2.5-sft-golden-hh

0
·
13
ZMC2019WarmTools2B32K

Qwen1.5B-L28-90K

0
·
13
TheGardenerWarmTools500M32K

Qwen2.5-0.5B-finetune-wikitext

0
·
13
Falln87WarmTools33B32K

Coder2.5-32b

1
·
13
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2

1
·
13
LansechenWarmTools3B32K

Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW

0
·
13
luckecianoWarmTools8B32K

Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
13
LansechenWarmTools3B32K

Qwen2.5-3B-Open-R1-GRPO-math-selected-default

0
·
13
ross-rlWarmTools33B32K

qwen2.5-coder-32b-instruct-sft-warmup-adapter-id-sft2

0
·
13
Yihong7788WarmTools8B32K

qwen2.5-2wiki-kg-sft-300

0
·
13
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27

0
·
13
mothnaZlWarmTools8B32K

long-sr-Qwen2.5-7B-Instruct

0
·
13
LansechenWarmTools8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0
·
13
cdreetzWarmTools2B32K

kwen2.5-1.5b

0
·
13
0xfaderWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scampering_scavenging_tapir

0
·
13
yusufbaykalogluWarmTools3B32K

Qwen2.5-3B-Turkish-SFT

1
·
13
stelliaWarmTools3B32K

stellialm_mini_qwen_9tasks

0
·
13
barandinhoWarmTools33B32K

qwen-2.5-32b-turkish-reasoning-consistency-rl

0
·
13
akshayballalWarmTools3B32K

Qwen2.5-3B-Instruct-Pubmed-16bit-GRPO

0
·
13
·
Jan 2026
gjyotin305WarmTools3B32K

Qwen2.5-3B-Instruct_new_alpaca_007

0
·
13
·
Jan 2026
UWNSLWarmTools3B32K

Qwen2.5-3B-Instruct_Mix-Large

0
·
13
·
Feb 2025
EvoNetWarmTools3B32K

EvoNet-3B-V1

0
·
13
·
Feb 2026
shawntzxWarmTools3B32K

Qwen2.5-3B-GRPO-3_13_math

0
·
13
·
Mar 2025
sxsaaWarmTools3B32K

Qwen2.5-3B-Math-Verifier-FullData-v2.0

0
·
13
·
Feb 2026
AdanatoWarmTools3B32K

qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_4

0
·
13
·
Feb 2026