Models

14,958
yasmine777Warm8B32K

nn

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-task_arithmetic-26

0
·
0
MrRobotoAIWarm8B8K

110

0
·
0
MergeBench-Llama-8B-itWarm8B32K

llama3-8b-it-GRPO-after-sft

0
·
0
mlfoundations-devWarm8B32K

openthoughts3_100k_buggy

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0
·
0
ZMC2019Warm8B32K

Qwen7B-L28-Flat-tuned

0
·
0
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_wildguard_jailbreak_2epoch

0
·
0
ZMC2019Warm8B32K

OpenR1-Qwen-7B-nsa-B1024-hwtrue

0
·
0
MergeBench-Llama-8B-itWarm8B32K

llama-3.1-8b-it_tulu-3-sft-personas-instruction-following_epoch3_0429

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
0
ybq0509Warm8B32K

sa_Q_7B_ckpt2250

0
·
0
LNGYEYXRWarm8B32K

Llama-3.1-8B-lora-step30

0
·
0
dslighfdslWarm8B32K

Llama-3.1-8B-Instruct-SFT-CoT-short

0
·
0
agg-shambhaviWarm8B32K

MimicLlama-3.1-8B-DPO

0
·
0
bharatwokeloWarm8B32K

Qwen-8b-finetuned-website-v3-merged-peft

0
·
0
wasmdashaiWarm8B32K

wasmai-7b-v1

2
·
0
LNGYEYXRWarm8B32K

Llama-3.1-8B-lora-pt-new

0
·
0
lihengmaWarm8B32K

Qwen-2.5-7B-Instruct_2wiki_kg_sfted

0
·
0
shariar076Warm8B8K

Llama-3.1-8B-Instruct-DPO-100R0L-PoliTune

0
·
0
toufImedWarm8B32K

Meta-Llama-3.1-8B-Instruct-finetuned_new

0
·
0
MrRobotoAIWarm8B8K

L1

0
·
0
ybq0509Warm8B32K

sd_Q_7B_ckpt2250

0
·
0
mlfoundations-devWarm8B32K

a1_science_stackexchange_physics_1k

0
·
0
Yihong7788Warm8B32K

qwen2.5-hotpotqa-sft-300

0
·
0
mlfoundations-devWarm8B32K

openthoughts3_300k_ckpts

0
·
0
LNGYEYXRWarm8B32K

Llama-3.1-8B-lora-pt

0
·
0
BoltMonkeyWarm8B32K

boltmonkey_shortreasoning-8b

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-dare_ties-29

0
·
0
shanchenWarm8B32K

ds-limo-linearja-250

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
0
noirchanWarm8B32K

Qwen2.5-Coder-7B_math_mergeTIES

0
·
0
shanchenWarm8B32K

ds-limo-1.1-250

0
·
0
CompassioninMachineLearningWarm8B32K

May3_PLORA_4_5thanimals_10kdata

0
·
0
pxyyyWarm8B32K

Llama3.1-8B-pxyyy-autoif-20k-1-1e-5

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27

0
·
0
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2

0
·
0
nguyenvuvnWarm8B32K

lla2m0a112

0
·
0
chansungWarm8B32K

Qwen2.5-7B-CCRL-2

0
·
0
mothnaZlWarm8B32K

long-sr-Qwen2.5-7B-Instruct

0
·
0
pxyyyWarm8B32K

Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
0