Models

40,032
mlfoundations-devWarm8B32K

evolinstruct_seeding_stackexchange_codegolf

0
·
2
mlfoundations-devWarm8B32K

camel_seeding_stackexchange_codegolf

0
·
2
mlfoundations-devWarm8B32K

llama3_mammoth_dcft_ablation_50k

0
·
2
mlfoundations-devWarm8B32K

seed_math_allenai_math

0
·
2
mlfoundations-devWarm8B32K

seed_math_open2math

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_stackoverflow_375000_samples

0
·
2
NalDiceWarm70B32K

askvox-llama3.3-70b-16bit

0
·
2
·
Jan 2025
hendrydongWarm8B32K

ckpt-t-1115

0
·
2
burgasdotproWarm8B32K

bgGPT-Qwen2.5-Math-7B-Inst

1
·
2
mlfoundations-devWarm8B32K

dpo_from_stratos_judged_annotated_rejected_responses

1
·
2
MrezaPRZWarm8B32K

picker_qwen

0
·
2
AmberYifanWarm8B32K

Qwen2.5-7B-sft-ultrachat-safeRLHF

0
·
2
netcat420Warm8B32K

DeepSeek-R1-Distill-Qwen-MFANN-Slerp-7b

0
·
2
mlfoundations-devWarm8B32K

Bespoke-Stratos-17k-v4

0
·
2
mlfoundations-devWarm8B32K

qwen_7b_instruct_extra_verified

0
·
2
burgasdotproWarm8B32K

bgGPT-DeepSeek-R1-Distill-Qwen-7B

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_science-and-puzzle-stratos-verified-scaled-1_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-unverified-scaled-0_125_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-unverified-scaled-0_25_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-unverified-scaled-0_5_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

llama3-1_8b_r1_annotated_aops

0
·
2
mlfoundations-devWarm8B32K

llama3-1_8b_4o_annotated_olympiads

0
·
2
mlfoundations-devWarm8B32K

dolphinr1

2
·
2
mlfoundations-devWarm33B32K

s1K_32b

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

seed_math_tiger_math_reasoninghp

0
·
2
mlfoundations-devWarm8B32K

multiple_samples_sharpening_numina_aime

0
·
2
mlfoundations-devWarm8B32K

multiple_samples_none_numina_aime_adjusted_samples

0
·
2
mlfoundations-devWarm8B32K

difficulty_sorting_high_seed_code

0
·
2
mlfoundations-devWarm8B32K

stratos_verified_plus_s1r1

0
·
2
bulkbeingsWarm8B32K

llama3.1-2eph-a100-all

0
·
2
mlfoundations-devWarm8B32K

stratos_verfied_v2_1

0
·
2
mlfoundations-devWarm8B32K

qwen2-5_sky_t1_2-5k_base

0
·
2
·
Feb 2025
mlfoundations-devWarm8B32K

seed_math_multiple_samples_scale_up_scaredy_cat_baseline

0
·
2
johnpaulbinWarm8B32K

tokiiii

0
·
2
alexxi19Warm12B32K

ft-v1-nemo-base-merge-v1

0
·
2
Jianyuan1Warm14B32K

deepseek-r1-14b-cot-math-reasoning-full

2
·
2
arazizimlWarm33B32K

sft_trainer

0
·
2
secmlrWarm8B32K

VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5

0
·
2
Maker-0409Warm8B32K

Qwen-2.5-7B-Simple-RL

0
·
2
fangyiliWarm8B32K

deepseek-distill-qwen-7b-merged-peft

0
·
2