Models

39,633
TOMFORD79Warm3B32K

model17

0
·
0
lihengmaWarm8B32K

Qwen-2.5-7B-Instruct_2wiki_kg_sfted

0
·
0
albertfaresWarm800M32K

DPO_MCQA_model

0
·
0
shariar076Warm8B8K

Llama-3.1-8B-Instruct-DPO-100R0L-PoliTune

0
·
0
toufImedWarm8B32K

Meta-Llama-3.1-8B-Instruct-finetuned_new

0
·
0
MrRobotoAIWarm8B8K

L1

0
·
0
ybq0509Warm32B32K

sc_Q_32B_ckpt1124

0
·
0
ybq0509Warm8B32K

sd_Q_7B_ckpt2250

0
·
0
mlfoundations-devWarm8B32K

a1_science_stackexchange_physics_1k

0
·
0
juhwWarm3B32K

q4104

0
·
0
Yihong7788Warm8B32K

qwen2.5-hotpotqa-sft-300

0
·
0
mlfoundations-devWarm8B32K

openthoughts3_300k_ckpts

0
·
0
LNGYEYXRWarm8B32K

Llama-3.1-8B-lora-pt

0
·
0
BoltMonkeyWarm8B32K

boltmonkey_shortreasoning-8b

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-dare_ties-29

0
·
0
shanchenWarm8B32K

ds-limo-linearja-250

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
0
r2e-editsWarm14B32K

qwen3_14b_sft_swesmith_r2e_v2_qwen3_format_32k_maxstep40_rft-20k_bz8_epoch2_lr1en5-v1

0
·
0
noirchanWarm8B32K

Qwen2.5-Coder-7B_math_mergeTIES

0
·
0
shanchenWarm8B32K

ds-limo-1.1-250

0
·
0
CompassioninMachineLearningWarm8B32K

May3_PLORA_4_5thanimals_10kdata

0
·
0
pxyyyWarm8B32K

Llama3.1-8B-pxyyy-autoif-20k-1-1e-5

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27

0
·
0
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2

0
·
0
shallow6414Warm27B32K

sn11-3-5-1

0
·
0
nguyenvuvnWarm8B32K

lla2m0a112

0
·
0
chansungWarm8B32K

Qwen2.5-7B-CCRL-2

0
·
0
mothnaZlWarm8B32K

long-sr-Qwen2.5-7B-Instruct

0
·
0
pxyyyWarm8B32K

Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6

0
·
0
mlfoundations-devWarm33B32K

openr1_32B

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
0
SrinivastlWarm4B4K

NyayaMitra

1
·
0
alvinmingWarm8B32K

es-qwen-math-base-7b-3k-stage2-6k-t2-ds_o2-step400

0
·
0
lihengmaWarm8B32K

Qwen-2.5-7B-Instruct_2wiki_text_sfted

0
·
0
AmberYifanWarm8B32K

Qwen2.5-7B-sft-ultrachat

1
·
0
OyasiWarm8B32K

msdialect

0
·
0
secmlrWarm8B32K

SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1

0
·
0
HINT-labWarm4B32K

Qwen3-4B-Baseline-SFT

0
·
0
finvixWarm500M32K

qwen-2.5-0.5B

0
·
0
HINT-labWarm8B32K

Qwen2.5-7B-Baseline-SFT

0
·
0
simonyclWarm4B32K

Qwen3-4B-SFT-KuhnPoker-step_250

0
·
0
nate-rahnWarm8B32K

0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs

0
·
0