airoboros_none_resp_gpt-4o-mini_inst_gpt-4o_resp
llama3-8b-final-ppo-clean-v0.1
OH_DCFT_V3_wo_collective_cognition
OH_original_wo_evol_instruct_70k
autotrain-llama-1-merged
llama3-1-ox-llms-8b-sft-only-germany-data-and-ultrafeedback
oh_v1.3_camel_chemistry_x4
oh_v1.3_slim_orca_x2
HikariBloom-v0.3-RP
rlhflow_mixture_clean_empty_round_with_dart_intuitive_sampled-20k-nolisa-2e-5-bs64
OH_DCFT_V3_wo_glaive_code_assistant
oh_v1.3_camel_biology_x2
oh_v1.3_opengpt_x2
oh_v1.3_slim_orca_x.125
oh_v1.3_slim_orca_x.25
oh_v1.3_slim_orca_x.5
oh_v1.3_unnatural_instructions_x4
llama3.1-8B_v2_model_16bit
oh-dcft-v3.1-llama-3.1-8b
hp_ablations_llama3_epoch2_dcftv1.2
hp_ablations_llama3_epoch3_dcftv1.2
oh_v1.3_camel_chemistry_x.25
SFT-base_merged_fp16
ofd1
model_6hs
oh_v1.3_alpaca_x.25
oh_v1.3_camel_biology_x.25
oh_v1.3_camel_chemistry_x8
oh_v1.3_camel_math_x2
llama3-1_8b_mlfoundations-dev-stackexchange_proofassistants
stackexchange_cs
stackexchange_movies
stackexchange_biology
stackexchange_hardwarerecs
oh_v1.3_camel_math_x.5
mergekit-model_stock-anvdilz
llm_model
llama3-1_8b_mlfoundations-dev-stackexchange_scicomp
stackexchange_blender
stackexchange_chinese
stackexchange_crypto
llama3-1_8b_mlfoundations-dev-stackoverflow_25000tasks_1p