Models

39,973
shanchenWarm8B32K

s1.1-limo-multilingual-4

0
·
2
CriteriaPOWarm3B32K

llama3.2-3b-dpo-finegrained

0
·
2
·
May 2025
ross-rlWarm33B32K

qwen2.5-coder-32b-instruct-sft-warmup-adapter-id-sft2

0
·
2
yjwonWarm9B16K

mpg27_gemma9b_sft

0
·
2
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_aya_2epoch

0
·
2
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3

0
·
2
iamsahinemirWarm8B8K

meta-llama

0
·
2
kamelcharafWarm3B32K

GRPO-SFT-qwen2.5-3B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
2
winglianWarm14B32K

qwen3-14b-triton-v1

0
·
2
kamelcharafWarm3B32K

GRPO-qwen2.5-3B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
2
MegaSWWarm3B32K

verl_sft

0
·
2
mlfoundations-devWarm8B32K

ot3_300k_ckpt-epoch4

0
·
2
Yihong7788Warm8B32K

qwen2.5-2wiki-kg-sft-300

0
·
2
shanchenWarm8B32K

ds-limo-fr-250

0
·
2
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b_wildguard_jailbreak_2epoch

0
·
2
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-slerp-29

0
·
2
yjwonWarm9B16K

mp_gemma9b_sft

0
·
2
d1shs0apWarm2B32K

easy-8k-med16k

0
·
2
sparkle-reasoningWarm8B32K

SparkleRL-7B-Stage2-hard

0
·
2
shanchenWarm8B32K

ds-limo-te-100

0
·
2
akbarsigitWarm8B32K

llama3.1-sft-r256-a512-merged-16bit

0
·
2
MinaMilaWarm3B8K

gemma2_2b_unlearned

0
·
2
dulguun222Warm3B32K

qwen_3b_math

0
·
2
alvinmingWarm8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320

0
·
2
alvinmingWarm8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step720

0
·
2
zztheavenWarm8B32K

Llama-3.1-8B-Instruct-Open-R1-GRPO

0
·
2
secmlrWarm8B32K

DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft

0
·
2
shanchenWarm8B32K

ds-limo-ja-100

0
·
2
GiuLeo01Warm3B32K

FortranCodeGen-3B-SynthData-onlysft

0
·
2
SinaElahimaneshWarm27B32K

Gemma-2-27b-IT-Therapy-Farsi-VLLM

0
·
2
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0
·
2
RefinedNeuroWarm8B8K

RN_TR_R1

0
·
2
AlphataoWarm8B32K

Affine-7470548

0
·
2
ZeroAgencyWarm24B32K

Mistral-Small-3.1-24B-Instruct-2503-hf

2
·
2
zwhe99Warm3B32K

Qwen2.5-3B-orz

0
·
2
Marco0Warm3B32K

gronger

0
·
2
joanna302Warm8B32K

Qwen3-8B-Base_fr_pt_zh_ar_2e-05_seed43

0
·
2
7DragonsWarm3B32K

Spider_2

0
·
2
morzzzWarm3B32K

one9

0
·
2
morzzzWarm3B32K

one3

0
·
2
memevisWarm3B32K

hug10

0
·
2
sam2aiWarm8B32K

llama_3.1_8b_r_1

0
·
2