Qwen2.5-7B-Instruct-SFT-Pubmed-16bit-DFT
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver10
meta-llama-Llama-3.1-8B-Instruct-sanitization-clean-OPI_SEP-42-202601102333
instruct_hpsearch_lr_3.0e-06_0
jan13_8-8-1_sdf
affine-g15-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG
ee_lm8_grpo
HaiJava-Surgeon-Qwen2.5-Coder-7B-SFT-v1
DynaGuard-8B-6750
mistral-7b-utterance
oh_v1_w_v3_camel_chemistry_gpt-4o-mini
oh_v1_w_v3_evol_instruct
OH_DCFT_V3_wo_dataforge_economics
OH_original_wo_metamath_40k
OH_original_wo_platypus
OH_original_wo_slimorca_550k
oh_v1_w_v3_camel_biology_gpt-4o-mini
oh_v1_w_v3_opengpt
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_camel_biology
oh_v1-2_only_evol_instruct
oh_v1-2_only_camel_chemistry
oh_v3-1_only_caseus_custom
oh_v3-1_only_dataforge_economics
oh_v3-1_only_glaive_code_assistant
oh_v3-1_only_cot_alpaca
oh_v3-1_only_gpt4_llm
airoboros_none_resp_gpt-4o-mini_inst_gpt-4o_resp
oh_v1.3_airoboros_x4
oh_v1.3_airoboros_x8
oh_v1.3_alpaca_x4
oh_v1.3_camel_chemistry_x4
oh_v1.3_camel_math_x8
oh_v1.3_opengpt_x.125
oh_v1.3_opengpt_x4
oh_v1.3_opengpt_x.25
oh_v1.3_slim_orca_x.25
oh_v1.3_slim_orca_x.5
oh_v1.3_unnatural_instructions_x.25
oh_v1.3_unnatural_instructions_x2
oh_v1.3_unnatural_instructions_x.125
hp_ablations_llama3_epoch3_dcftv1.2
oh_v1.3_airoboros_x.5