seed_math_automathtext_reasoninghp
seed_math_open2math_reasoninghp
multiple_samples_majority_consensus_pick_one_numina_aime_math_verify
difficulty_sorting_easy_seed_code
difficulty_sorting_high_seed_code
difficulty_sorting_random_seed_code
stratos_verified_mix_epochs2
seed_math_multiple_samples_scale_up_scaredy_cat_all
llama_openthoughts_sorted
instruction_filtering_scale_up_code_base_askllm_16K
instruction_filtering_scale_up_code_base_fasttext_per_domain_16K
Qwen2.5-7B-Instruct-userfeedback-SFT
Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter1
Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B
openthoughts3_100k_llama3
openthoughts3_30k_llama3
openthoughts3_1k_llama3
llama_8b_unlearned_unbalanced_gender_1e-6_1.0_0.25_0.5_epoch3
Qwen2.5-7B-Instruct_openthoughts3_math_100k_annotated_QwQ-32B
e1_math_all_qwq_together
Qwen2.5-7B-Instruct_qwq_mix_qwen3_science
llama_8b_unlearned_unbalanced_gender_2nd_1e-6_1.0_0.05_0.15_0.25_epoch1
e1_science_longest_phi
llama_8b_unlearned_unbalanced_gender_2nd_5e-7_1.0_0.5_0.25_0.5_epoch2
Qwen2.5-7B-Instruct-ultrafeedback-11k
Qwen2.5-7B-Instruct-wildfeedback-11k
llama-3.1-8b-eppc-annotator-filtered
glm46-glaive-code-assistant-sandboxes-maxeps-131k
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-10
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-15
your-model-name
krx_Llama3.1_8b_instruct_M1_all_data_sg
krx_Llama3.1_8b_instruct_M3_all_data_sg
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-5
llama_3_unsafe_helpful
vetllm-mistral-7b-merged-book-3
EagleX_1-7T
openchat-3.6-ko-sft
top_9_ranking_stackexchange
top_17_ranking_stackexchange
simpo-evol_tt_5s
simpo-oh_teknium_scaling_down_ratiocontrolled_0.9