Models

14,989
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter1

1
·
1
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2

1
·
1
LsTamWarm8B32K

stellialm_smallfr_qwen7b_9tplus

0
·
1
·
Dec 2024
mlfoundations-devWarm8B32K

openthoughts3_10k

0
·
1
mlfoundations-devWarm8B32K

openthoughts3_100k

0
·
1
yununuyWarm8B32K

guesswho-scale-base

0
·
1
sorgfresserWarm8B32K

testtrainsft

0
·
1
ZMC2019Warm8B32K

OpenR1-Qwen-7B-nsa-B1024-hwfalse

0
·
1
neural-coderWarm8B32K

finetuned-5

0
·
1
mlfoundations-devWarm8B32K

openthoughts3_100k_llama3

0
·
1
shanchenWarm8B32K

ds-limo-te-50

0
·
1
shanchenWarm8B32K

ds-limo-th-50

0
·
1
mlfoundations-devWarm8B32K

openthoughts3_30k_llama3

1
·
1
mombipWarm8B32K

Meta-Llama-3.1-8B-Instruct

0
·
1
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-dare_ties-27

0
·
1
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_Magicoder-Evol-Instruct-110K_2epoch

0
·
1
shanchenWarm8B32K

ds-limo-ja-50

0
·
1
mlfoundations-devWarm8B32K

openthoughts3_1k_llama3

0
·
1
kamelcharafWarm8B32K

GRPO-meta-3.1-8B-meta-3.1-8B-mrd3-s7-sum_token_prompt-merged

0
·
1
inpars-plusWarm8B32K

Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO

0
·
1
neural-coderWarm8B32K

xlam-finetuned

1
·
1
ferdinandjasongWarm8B32K

SuperCoder-7B-Qwen2.5-peft-merged

0
·
1
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Instruct-Merged-ties-29

0
·
1
hendrydongWarm8B32K

qwen-math-7b-raftpp-step120

0
·
1
izzcwWarm8B8K

large_cooking_sft_success

1
·
1
shanchenWarm8B32K

s1.1-limo-multilingual-4

0
·
1
mlfoundations-devWarm8B32K

nemo_nano_300k

0
·
1
shariar076Warm8B8K

Llama-3.1-8B-Instruct-DPO-0R100L-PoliTune

0
·
1
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3

0
·
1
MinaMilaWarm8B32K

llama_8b_unlearned_unbalanced_gender_1e-6_1.0_0.25_0.5_epoch3

0
·
1
MergeBench-Llama-8B-itWarm8B32K

llama-3.1-8b-it_aya_2epoch

0
·
1
joonleeskyWarm8B32K

qwen_chess1_3of5

0
·
1
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it-GRPO-after-sft

0
·
1
jlpang888Warm8B8K

Llama-3-Base-8B-SFT-SimPO

0
·
1
shanchenWarm8B32K

ds-limo-fr-100

0
·
1
CompassioninMachineLearningWarm8B32K

alpacallama_plus1k_80_20mix

0
·
1
MrRobotoAIWarm8B8K

A1

0
·
1
mlfoundations-devWarm8B32K

ot3_300k_ckpt-epoch4

0
·
1
Yihong7788Warm8B32K

qwen2.5-2wiki-kg-sft-300

0
·
1
JennnyWarm8B32K

llama3_8b_sft_helpsteer

0
·
1
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b_aya_2epoch

0
·
1
sparkle-reasoningWarm8B32K

SparkleRL-7B-Stage2-mix

0
·
1