llama-7b-obs-cancel-block-80pct
llama-7b-ria-70pct
llama-7b-obs-cancel-block-70pct
llama-7b-ria-80pct
llama2_7b_chat_medaq_resta_gamma0.3
OPI
qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch3
llama2_7b_chat_only_rsn_tuned_lr5e-5_revised
llama-7b-ria-30pct
Llama3-8B-SFT-SyntheticMedical-bnb-4bit
Collaiborator-MEDLLM-Llama-3-8B-v1
OH_original_wo_null_sources
OpenHermes-2.5-sedrick
9
16
llama3-1_8b_physics_500000_samples
oh_scale_x.125_compute_equal
oh_scale_x.25_compute_equal
oh_scale_x2_compute_equal
try9
try24
try8
llama3_mammoth_dcft_ablation_original_50k
Police_Model
s1K_llama3.1_8b_32kcontext
Qwen2.5-7B-Open-R1-Distill
DCFT-Stratos-Verified-114k-7B-4gpus-systemprompt-packing
Qwen2-7B-sft-ultrachat-safeRLHF
stratos_unverified_mix_2nodes
stratos-unverified-mix-scaled-0.25
stratos-verified-mix-scaled-0.25
math-stratos-unverified-scaled-0.5
math-stratos-verified-scaled-0.5
math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
qwen_s1ablation_length_filter_1k
Llama-3.1-8B-sft-ultrachat-hhrlhf
qwen_s1ablation_diversity_sampling_27k
qwen_s1ablation_length_filter_9k_10e
Meta-Llama-3.1-8B-Instruct-PUG-hc-playbook-3epochs-2e-5
phi_30K_qwq_0K
openthoughts3_100k_buggy