stackexchange_stackoverflow
stackoverflow_5000tasks_1p
stackoverflow_10000tasks_1p
oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_2x
9
16
multi-turn-Jan5
llama3-1_8b_webinstruct_750k
original_tiger_dataset_small
top_1_ranking_stackexchange
top_3_ranking_stackexchange
llama3-open-ko-8b-shimshimi
llama3-open-ko-8b-Instruct-shimshimi-500-ver2
llama3_sft_balanced_rr60k_train_on_corr_ep3
Llama3-GSM8K-w2c74.5K-c175K-c2c40K-3ep
llama3-8B-Instruct_PIFT-enja_manywords_2000
llama3-8B-Instruct_MIFT-en_manywords_2000
llama3-8B-Instruct_MIFT-ja_manywords_2000
top_8_ranking_stackexchange
top_6_ranking_stackexchange
top_7_ranking_stackexchange
top_9_ranking_stackexchange
top_17_ranking_stackexchange
llama3_orm_tmp10
llama3_orm_tmp10_2
infoNCA_ultrafeedback_alpha_1e-2_update_401_online
llama3_8b_chat_msj_reptune_bigger_mixed1
llama3_8b_chat_msj_reptune_bigger_mixed2
Llama3-sft-more-corr-rr60k-2ep
Llama3-sft-less-corr-rr60k-2ep
de-v3.1
oh_v1.3_slim_orca_x4
oh_v1.3_evol_instruct_x8
llama3-1_8b_physics_100000_samples
llama3-1_8b_physics_500000_samples
llama3-1_8b_math_500000_samples
de-v3.3
oh_scale_x.125_compute_equal
oh_scale_x.25_compute_equal
oh_scale_x2_compute_equal
simpo-evol_tt_5s
simpo-oh_teknium_scaling_down_ratiocontrolled_0.9