llama-7b-ria-40pct
llama-7b-ria-30pct
Llama3-8B-SFT-SyntheticMedical-bnb-4bit
invoice-structured-extraction-sft
Collaiborator-MEDLLM-Llama-3-8B-v1
OH_original_wo_null_sources
OpenHermes-2.5-sedrick
9
16
llama3-1_8b_physics_500000_samples
oh_scale_x.125_compute_equal
oh_scale_x.25_compute_equal
oh_scale_x2_compute_equal
try9
try24
try8
llama3_mammoth_dcft_ablation_original_50k
Police_Model
s1K_llama3.1_8b_32kcontext
Qwen2.5-7B-Open-R1-Distill
DCFT-Stratos-Verified-114k-7B-4gpus-systemprompt-packing
Qwen2-7B-sft-ultrachat-safeRLHF
stratos_unverified_mix_2nodes
stratos-unverified-mix-scaled-0.25
stratos-verified-mix-scaled-0.25
math-stratos-unverified-scaled-0.5
math-stratos-verified-scaled-0.5
math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
qwen_s1ablation_length_filter_1k
Llama-3.1-8B-sft-ultrachat-hhrlhf
qwen_s1ablation_diversity_sampling_27k
qwen_s1ablation_length_filter_9k_10e
Meta-Llama-3.1-8B-Instruct-PUG-hc-playbook-3epochs-2e-5
phi_30K_qwq_0K
openthoughts3_100k_buggy
a1_science_stackexchange_physics_1k
openthoughts3_300k_ckpts
Qwen2.5-7B-sft-ultrachat
Qwen2.5-7B-Baseline-SFT
0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Llama-3.1-8B-sft-SPIN-gpt4o-ORPO