stratos-unverified-mix-scaled-0.25
stratos-verified-mix-scaled-0.25
math-stratos-unverified-scaled-0.5
math-stratos-verified-scaled-0.5
math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
LIMO_32B
s1K_32b_v2
qwen_s1ablation_length_filter_1k
LIMO_limoconfigs_16k
qwen_s1ablation_diversity_sampling_27k
qwen_s1ablation_length_filter_9k_10e
OpenBuddy-R1-0528-Distill-Qwen2.5-72B-Preview0
phi_30K_qwq_0K
openthoughts3_100k_buggy
a1_science_stackexchange_physics_1k
openthoughts3_300k_ckpts
Qwen2.5-7B-sft-ultrachat
Qwen2.5-7B-Baseline-SFT
0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Omega-Darker_The-Final-Directive-14B
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-cwe-checkpoint-12
Qwen2.5-32B-Instruct_medical_mlp-down_full
Qwen2.5-32B-Instruct_medical_attention-kv_resp
Qwen2.5-32B-Instruct_medical_mlp_resp
Qwen2.5-32B-Instruct_medical_mlp_full
Qwen2.5-32B-Instruct_medical_all_resp
Qwen2.5-32B-Instruct_insecure_all_resp
Qwen2.5-32B-Instruct_medical_mlp-down_resp
Qwen2.5-32B-Instruct_medical_attention_full
Qwen2.5-32B-Instruct_medical_attention_resp
Qwen2.5-32B-Instruct_auto_all_resp
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-checkpoint-12
qwen2-5-3b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0324
qwen2-5-14b-ins-qwen2-5-7b-ins-basic-newprompt-0328
codesentinel-full
hackwatch-monitor