original_tiger_dataset_small
llama3-1_8b_math_50000_samples
top_1_ranking_stackexchange
top_3_ranking_stackexchange
llama3_sft_balanced_rr60k_train_on_corr_ep3
Llama3-GSM8K-w2c74.5K-c175K-c2c40K-3ep
top_8_ranking_stackexchange
top_6_ranking_stackexchange
top_7_ranking_stackexchange
llama3_orm_tmp10_2
infoNCA_ultrafeedback_alpha_1e-2_update_401_online
llama3_8b_chat_msj_reptune_bigger_mixed1
oh_v1.3_evol_instruct_x8
llama3-1_8b_physics_100000_samples
simpo-oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_8x
simpo-oh_v3.1_wo_camel_ai_math
simpo-stackexchange_christianity
top_13_ranking_stackexchange
top_20_ranking_stackexchange
mlfoundations-dev_stackoverflow_500000_samples
0128teacher_checkpoint_0
0128student_checkpoint_0
Reasoning-Llama-3.1-CoT-RE1
Qwen2.5-Coder-7B-Instruct-20-v2
math-stratos-verified-scaled-0.125
math-stratos-unverified-scaled-0.125
mlfoundations-dev_code-stratos-verified-scaled-0_125_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-1_stratos_7b
llama3-1_8b_r1_annotated_math
mlfoundations-dev_code-stratos-verified-scaled-0_5_stratos_7b
llama3-1_8b_multiple_samples_all_numina_aime
llama3-1_8b_multiple_samples_majority_consensus_numina_aime
multiple_samples_majority_consensus_numina_aime_math_verify
mlfoundations-dev_stratos-verified-mix-scaled-1_stratos_7b
seed_math_automathtext_reasoninghp
seed_math_open2math_reasoninghp
multiple_samples_majority_consensus_pick_one_numina_aime_math_verify
difficulty_sorting_easy_seed_code
difficulty_sorting_high_seed_code
difficulty_sorting_random_seed_code
stratos_verified_mix_epochs2
seed_math_multiple_samples_scale_up_scaredy_cat_all