16
llama3-8B-Instruct_MIFT-ja_manywords_2000
llama3-1_8b_physics_500000_samples
oh_scale_x.125_compute_equal
oh_scale_x.25_compute_equal
oh_scale_x2_compute_equal
try9
try24
try8
llama3_mammoth_dcft_ablation_original_50k
sn29_z2m4_ezwv
Police_Model
sn29_w1m1_h9i7
sn29_x1m6_etuc
s1K_llama3.1_8b_32kcontext
sn29_x1m4_ghvn
Qwen2.5-7B-Open-R1-Distill
DCFT-Stratos-Verified-114k-7B-4gpus-systemprompt-packing
Qwen2-7B-sft-ultrachat-safeRLHF
stratos_unverified_mix_2nodes
stratos-unverified-mix-scaled-0.25
stratos-verified-mix-scaled-0.25
math-stratos-unverified-scaled-0.5
math-stratos-verified-scaled-0.5
math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
LIMO_32B
s1K_32b_v2
qwen_s1ablation_length_filter_1k
LIMO_limoconfigs_16k
MedicalEDI-14b-EDI-Base-1
Llama-3.1-8B-sft-ultrachat-hhrlhf
qwen_s1ablation_diversity_sampling_27k
Isabelle_FVELer_SFT
DSR1-Qwen-32B-DSR1-Qwen-32B-131fad2c
qwen_s1ablation_length_filter_9k_10e
Llama-3.3-Illya
Llama-3.3-Utsukushi-Alpha
Llama-3.3-Ellie
Qwen2.5-14B-Instruct-131K
Forgotten-Safeword-24B-V3.0
agent_router_training_conversation_model_Qwen_14B