oh_v1.3_camel_math_x.25
llama3_non_delete_rr40k_2e6_bz32_ep3
llama3-1_8b_mlfoundations-dev-stackexchange_puzzling
llama3-1_8b_mlfoundations-dev-stackoverflow_25000tasks_0p
llama3-1_8b_mlfoundations-dev-stackoverflow_10000tasks__5p
llama3-1_8b_mlfoundations-dev-stackoverflow_25000tasks__5p
llama3_openmath_1m_ep1
stackoverflow_5000tasks_.75p
stackoverflow_10000tasks_1p
multi-turn-Jan5
llama3-1_8b_webinstruct_750k
original_tiger_dataset_small
llama3-1_8b_math_50000_samples
top_1_ranking_stackexchange
top_3_ranking_stackexchange
llama3_sft_balanced_rr60k_train_on_corr_ep3
Llama3-GSM8K-w2c74.5K-c175K-c2c40K-3ep
top_8_ranking_stackexchange
top_6_ranking_stackexchange
top_7_ranking_stackexchange
llama3_orm_tmp10_2
infoNCA_ultrafeedback_alpha_1e-2_update_401_online
llama3_8b_chat_msj_reptune_bigger_mixed1
de-v3.1
oh_v1.3_evol_instruct_x8
llama3-1_8b_physics_100000_samples
simpo-oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_8x
simpo-oh_v3.1_wo_camel_ai_math
simpo-stackexchange_christianity
top_13_ranking_stackexchange
top_20_ranking_stackexchange
mlfoundations-dev_stackoverflow_500000_samples
0128teacher_checkpoint_0
0128student_checkpoint_0
Reasoning-Llama-3.1-CoT-RE1
lora_9feb_llama8b_deepseek_backdoor
Qwen2.5-Coder-7B-Instruct-20-v2
math-stratos-verified-scaled-0.125
math-stratos-unverified-scaled-0.125
mlfoundations-dev_code-stratos-verified-scaled-0_125_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-1_stratos_7b
llama3-1_8b_r1_annotated_math