qwen25-7b
mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_baseline
0
jan-hq/Deepseek-Qwen2.5-7B-Redistil
21
qwen25-32b
mlfoundations-dev/mlfoundations-dev_extra_verified-32B
2
mlfoundations-dev/difficulty_sorting_random_seed_code
1
mlfoundations-dev/multiple_samples_none_numina_aime_adjusted_samples
mlfoundations-dev/verified_stratos_mix_no_proofs_without_metadata
qwen2-7b
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
3
mlfoundations-dev/seed_math_tiger_math_reasoninghp
mlfoundations-dev/mlfoundations-dev_stratos-unverified-mix-scaled-0_5_stratos_7b
mlfoundations-dev/mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b
mlfoundations-dev/llama3-1_8b_distill_70b_infra_baseline_r1_2.5k
mlfoundations-dev/s1K_reformat
qwen25-14b
rcds/Qwen2.5-14B-Instruct-SLDS
4
mlfoundations-dev/llama3-1_8b_r1_annotated_math
mlfoundations-dev/mlfoundations-dev_code-stratos-verified-scaled-0_125_stratos_7b
mlfoundations-dev/mlfoundations-dev_code-stratos-verified-scaled-1_stratos_7b
mlfoundations-dev/mlfoundations-dev_code-stratos-unverified-scaled-0_25_stratos_7b
mlfoundations-dev/mlfoundations-dev_science-and-puzzle-stratos-verified-scaled-0_5_stratos_7b
mlfoundations-dev/DCFT-Stratos-Unverified-114k-32B
burgasdotpro/bgGPT-DeepSeek-R1-Distill-Qwen-7B