oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_2x
top_2_ranking_stackexchange
llama3-open-ko-8b-shimshimi
llama3-open-ko-8b-Instruct-shimshimi-500-ver2
Llama-3.1-8B-InitializedEmbeddings_with_Hermes-3
oh-dcft-v3.1-SN-405B-hacky
top_10_ranking_stackexchange
L3.1-Artemis-g-8B
llama3.1_korean_v1.3_sft_by_aidx
infoNCA_ultrafeedback_alpha_1e-2_update_401_online
llama3_8b_chat_msj_reptune_bigger_mixed2
de-v3.1
ko-Meta-Llama-3.1-8B-Instruct
reasoning_sft_uf_dp_1k3k_lr_1e-6_gas_16_1735956551
ckpt-0110-v2
de-v3.3
de-v3.5
oh-dcft-v3.1-llama-3.1-405b-v2dummytesting
simpo-stackoverflow_25000tasks_1p
oh_scale_x4_compute_equal
open-o1-sft-original-plus-oh-v3.1
sky-t1-original-llama-instruct
top_11_ranking_stackexchange
llama-3-8b-Instruct_ftjob-2581e9f8d338
alpaca_seeding_stackexchange_codegolf
evolinstruct_seeding_stackexchange_codegolf
llama3_mammoth_dcft_ablation_50k
seed_math_allenai_math
seed_math_open2math
seed_math_tiger_lab_math
mlfoundations-dev_stackoverflow_50000_samples
mlfoundations-dev_stackoverflow_375000_samples
ckpt-t-1115
bgGPT-Qwen2.5-Math-7B-Inst
Bespoke-Stratos-17k-v3
dpo_from_stratos_judged_annotated_rejected_responses
Bespoke-Stratos-17k-v4
qwen_7b_instruct_extra_verified
mlfoundations-dev_science-and-puzzle-stratos-verified-scaled-1_stratos_7b
mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_125_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_25_stratos_7b