Models

2,770
minhtuan7akpWarmTools500M32K

qwen2.5_0.5b_base_scratch_reasoning_finetune

0
·
8
SVECTOR-CORPORATIONWarmTools3B32K

Theta-35-Mini

10
·
8
·
Apr 2025
jxoptionalWarmTools15B32K

xori-1-14b

1
·
8
·
Mar 2026
mlfoundations-devWarmTools8B32K

mlfoundations-dev_code-stratos-verified-scaled-1_stratos_7b

0
·
7
mlfoundations-devWarmTools8B32K

llama3-1_8b_4o_annotated_math

0
·
7
legmlaiWarmTools15B32K

legml-v0.1

14
·
7
·
Nov 2024
kamelcharafWarmTools3B32K

GRPO-SFT-qwen2.5-3B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
7
razor534WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stocky_nasty_pheasant

0
·
7
·
Jun 2025
Danau5tinWarmTools3B32K

calculator_agent_qwen2.5_3b

3
·
7
predibaseWarmTools33B32K

Predibase-T2T-32B-RFT

20
·
7
·
Mar 2025
PeterJinGoWarmTools3B32K

SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2

0
·
7
Usman391WarmTools3B32K

qwen-3B-stego-2-codes

0
·
7
·
Jan 2026
Usman391WarmTools3B32K

qwen-3B-stego-no-codes

0
·
7
·
Jan 2026
reds0510WarmTools3B32K

qwq_mixed_evol8k_aug4k_1e5

0
·
7
·
Jan 2026
DXCLabWarmTools3B32K

OncoCareBrain-GPT

2
·
7
·
Mar 2025
Mojo7WarmTools3B32K

Katkut-3B

1
·
7
·
Feb 2026
PeterJinGoWarmTools3B32K

SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.3

0
·
7
·
May 2025
archiiiiWarmTools3B32K

medical-qwen-315

0
·
7
·
Mar 2026
ogulcanaydoganWarmTools33B32K

Turkish-LLM-32B-Instruct

1
·
7
·
Mar 2026
mlfoundations-devWarmTools33B32K

DCFT-Stratos-Unverified-114k-32B

0
·
6
mlfoundations-devWarmTools8B32K

stratos-unverified-mix-scaled-1

0
·
6
minhtuan7akpWarmTools500M32K

qwen2.5_0.5b_base_qa_finetune_v3

0
·
6
dulguun222WarmTools3B32K

qwen_3b_math

0
·
6
p2g3ads4WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-camouflaged_tame_alpaca

0
·
6
cryptobrosWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-endangered_burrowing_sealion

0
·
6
Silin1590WarmTools8B32K

Qwen-7B-Int-CoT

0
·
6
yrshiWarmTools3B32K

AutoRefine-Qwen2.5-3B-Instruct

1
·
6
linxyWarmTools15B32K

RETuning-DeepSeek_R1_14B_SFT_GRPO

1
·
6
philipperen55WarmTools15B32K

Qwen2.5-14B-style-MERGED-v3-BF16

0
·
6
·
Dec 2025
webbigdataWarmTools3B32K

FanFic-Illustrator

14
·
6
·
Mar 2025
yurunyyrWarmTools3B32K

agentic-futoshiki-NoStateTrans_qwen2.5-3B-5e-6_gt-SFT_20k

0
·
6
·
Jan 2026
ray0rf1reWarmTools3B32K

Nix2.5-plus

1
·
6
·
Jan 2026
PhonsiriWarmTools3B32K

Qwen2.5-3B-Math-Distilled

0
·
6
·
Feb 2026
yzxjbWarmTools3B32K

RL-PW0.6-Qwen2.5-Decision-step20

0
·
6
·
Mar 2026
long-horizon-reasoningWarmTools3B32K

Qwen-3b-GRPO-len-5

0
·
6
·
Sep 2025
LegendaryDawnWarmTools3B32K

SDRL-icml_rebuttal-freq-Qwen2.5-3B-majority_n8_l2048-DAPO_n8_bs256_long8-step200

0
·
6
·
Mar 2026
mlfoundations-devWarmTools8B32K

DCFT-Stratos-Verified-114k-7B-4gpus

1
·
5
mlfoundations-devWarmTools8B32K

oh-dcft-v3.1-claude-3-5-sonnet-20241022-qwen

0
·
5
mlfoundations-devWarmTools8B32K

llama3-1_8b_4o_annotated_aops

0
·
5
mlfoundations-devWarmTools8B32K

s1K_reformat

0
·
5
mlfoundations-devWarmTools8B32K

difficulty_sorting_easy_seed_math

0
·
5
·
Feb 2025
mlfoundations-devWarmTools8B32K

stratos_verified_plus_s1r1

0
·
5