evolinstruct_seeding_stackexchange_codegolf
camel_seeding_stackexchange_codegolf
llama3_mammoth_dcft_ablation_50k
seed_math_allenai_math
seed_math_open2math
mlfoundations-dev_stackoverflow_375000_samples
askvox-llama3.3-70b-16bit
ckpt-t-1115
bgGPT-Qwen2.5-Math-7B-Inst
dpo_from_stratos_judged_annotated_rejected_responses
picker_qwen
Qwen2.5-7B-sft-ultrachat-safeRLHF
DeepSeek-R1-Distill-Qwen-MFANN-Slerp-7b
Bespoke-Stratos-17k-v4
qwen_7b_instruct_extra_verified
bgGPT-DeepSeek-R1-Distill-Qwen-7B
mlfoundations-dev_science-and-puzzle-stratos-verified-scaled-1_stratos_7b
mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_125_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_25_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_5_stratos_7b
llama3-1_8b_r1_annotated_aops
llama3-1_8b_4o_annotated_olympiads
dolphinr1
s1K_32b
mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b
seed_math_tiger_math_reasoninghp
multiple_samples_sharpening_numina_aime
multiple_samples_none_numina_aime_adjusted_samples
difficulty_sorting_high_seed_code
stratos_verified_plus_s1r1
llama3.1-2eph-a100-all
stratos_verfied_v2_1
qwen2-5_sky_t1_2-5k_base
seed_math_multiple_samples_scale_up_scaredy_cat_baseline
tokiiii
ft-v1-nemo-base-merge-v1
deepseek-r1-14b-cot-math-reasoning-full
sft_trainer
VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5
Qwen-2.5-7B-Simple-RL
deepseek-distill-qwen-7b-merged-peft