oh_scale_x.125_compute_equal
oh_scale_x.25_compute_equal
oh_scale_x2_compute_equal
try9
try24
try8
llama3_mammoth_dcft_ablation_original_50k
sn29_z2m4_ezwv
Police_Model
sn29_w1m1_h9i7
s1K_llama3.1_8b_32kcontext
DCFT-Stratos-Verified-114k-7B-4gpus-systemprompt-packing
Qwen2-7B-sft-ultrachat-safeRLHF
stratos_unverified_mix_2nodes
stratos-unverified-mix-scaled-0.25
stratos-verified-mix-scaled-0.25
math-stratos-unverified-scaled-0.5
math-stratos-verified-scaled-0.5
math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
LIMO_32B
s1K_32b_v2
qwen_s1ablation_length_filter_1k
LIMO_limoconfigs_16k
Llama-3.1-8B-sft-ultrachat-hhrlhf
qwen_s1ablation_diversity_sampling_27k
qwen_s1ablation_length_filter_9k_10e
Llama-3.3-Utsukushi-Alpha
Llama-3.3-Ellie
Meta-Llama-3.1-8B-Instruct-PUG-hc-playbook-3epochs-2e-5
gemma-3-4B-function-calling-v0.4
Magistral-Small-2506
phi_30K_qwq_0K
qwen3-14b-ug40-pretrained
Llama-3.3-70B-Aster-v0-stage3
qwen25coder-14b-end2end_sonnet_combined_maxstep40_sft-32k_bz8_epoch2_lr1en5-v1
openthoughts3_100k_buggy
MimicLlama-3.1-8B-DPO
Llama-3.1-8B-Instruct-DPO-100R0L-PoliTune
codenames-14b-sft
a1_science_stackexchange_physics_1k
openthoughts3_300k_ckpts