Models

38,944
MergeBench-Llama-8B-itCold8B32K

llama3-8b-it-GRPO-after-sft

0
·
0
mlfoundations-devCold8B32K

openthoughts3_100k_buggy

0
·
0
luckecianoCold8B32K

Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
0
agg-shambhaviCold8B32K

MimicLlama-3.1-8B-DPO

0
·
0
wasmdashaiCold8B32K

wasmai-7b-v1

2
·
0
LNGYEYXRCold8B32K

Llama-3.1-8B-lora-pt-new

0
·
0
TOMFORD79Cold3B32K

model17

0
·
0
Shaleen123Cold14B32K

MedicalEDI-14b-EDI-Base-Final

1
·
0
shariar076Cold8B8K

Llama-3.1-8B-Instruct-DPO-100R0L-PoliTune

0
·
0
MrRobotoAICold8B8K

L1

0
·
0
mlfoundations-devCold8B32K

a1_science_stackexchange_physics_1k

0
·
0
mlfoundations-devCold8B32K

openthoughts3_300k_ckpts

0
·
0
Yuuta208Cold8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-dare_ties-29

0
·
0
shanchenCold8B32K

ds-limo-linearja-250

0
·
0
Yuuta208Cold8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
0
riddickzCold8B32K

Llama-3.1-8B-Instruct_kg3.5k_2e5

0
·
0
shanchenCold8B32K

ds-limo-1.1-250

0
·
0
pxyyyCold8B32K

Llama3.1-8B-pxyyy-autoif-20k-1-1e-5

0
·
0
Yuuta208Cold8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27

0
·
0
shallow6414Cold27B32K

sn11-3-5-1

0
·
0
mlfoundations-devCold33B32K

openr1_32B

0
·
0
rchan26Cold14B32K

t0-14B-test

0
·
0
luckecianoCold8B32K

Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
0
SrinivastlCold4B4K

NyayaMitra

1
·
0
alvinmingCold8B32K

es-qwen-math-base-7b-3k-stage2-6k-t2-ds_o2-step400

0
·
0
AmberYifanCold8B32K

Qwen2.5-7B-sft-ultrachat

1
·
0
HINT-labCold4B32K

Qwen3-4B-Baseline-SFT

0
·
0
HINT-labCold8B32K

Qwen2.5-7B-Baseline-SFT

0
·
0
nate-rahnCold8B32K

0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs

0
·
0
patrickcmdCold14B32K

qwen3-14b-ug40-merged

0
·
0
weifarCold8B32K

merged_318b_c

0
·
0
mlfoundations-devCold32B32K

QwQ-32B_enable-liger-kernel_False_OpenThoughts3_1k

0
·
0
mlfoundations-devCold8B32K

Qwen2.5-7B-Instruct_openthoughts3_math_100k_annotated_QwQ-32B

0
·
0
WenFenggCold3B32K

guys_6

0
·
0
WenFenggCold3B32K

guys_1

0
·
0
OnDeviceMedNotesCold1B32K

Medical_Summary_Notes

1
·
0
WenFenggCold3B32K

guys_2

0
·
0
HitmanRebornCold3B32K

COffee_C

0
·
0
mlfoundations-devCold32B32K

QwQ-32B_openthoughts3_100k

0
·
0
mlfoundations-devCold32B32K

QwQ-32B_enable-liger-kernel_False_OpenThoughts3_3k

0
·
0
WenFenggCold3B32K

guys_4

0
·
0
WenFenggCold3B32K

guys_5

0
·
0