llama3_8b_baseline_instructskillmix
llama-3-8b-bnfx-finetune
llama-3.1-8B-thesis-aligned
C1-2
C1-3
OH_original_wo_airoboros
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_opengpt
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_metamath
prm_gsm_2k_with_full_sol_mix_ref_redistribution_hf
internal_audit_new
oh_v3-1_only_evol_instruct_140k
oh_v1.3_unnatural_instructions_x.5
oh_v1.3_evol_instruct_x.5
stackexchange_codereview
Llama-3.3-70B-Memo-law-Instruct-v2
stackexchange_earthscience
infoNCA_ultrafeedback_update_201
stackexchange_mathematica
stackexchange_vegetarianism
stackexchange_webapps
Llama-3.3-70B-Memo-law-Instruct-v1
Llama-3.1-8B-Instruct-D1DPO_2048
mergekit-model_stock-bzcrthr
ko-Meta-Llama-3.1-8B-Instruct
Wisedom-8B-EmbeddingReordering
ofdbase
simpo-oh_teknium_scaling_down_random_0.4
Llama-3.3-Argunaut-1-70B-SFT
llama_instruct_adult_seed_42
llama3-1_8b_codefeedback
llama3-1_8b_dolphin
llama3-1_8b_share_gpt_code
seed_math_tiger_math
7B-v0.2
DEFUNCT-EXPERIMENT2_2
Deeepseek-QwenSlerp4-32B
llama3-1_8b_4o_annotated_aime
llama3-1_8b_4o_annotated_aops
s1K_reformat
qwen2-5_sky_t1_2-5k_alternative_r1_distill_llama70b
qwen2-5_sky_t1_2-5k_rewrite_r1_distill_llama70b
llama3.1-2eph-a100-all