math-stratos-verified-scaled-1
verified_stratos_mix_below_16384_cutoff_without_metadata
qwen_s1ablation_length_filter_1k
Llama-3.1-8B-sft-ultrachat-hhrlhf
qwen_s1ablation_diversity_sampling_27k
qwen_s1ablation_length_filter_9k_10e
Meta-Llama-3.1-8B-Instruct-PUG-hc-playbook-3epochs-2e-5
phi_30K_qwq_0K
openthoughts3_100k_buggy
a1_science_stackexchange_physics_1k
openthoughts3_300k_ckpts
Qwen2.5-7B-sft-ultrachat
Qwen2.5-7B-Baseline-SFT
0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Llama-3.1-8B-sft-SPIN-gpt4o-ORPO
0615-sft_info_wc_multi_attrs-qwen3_8b_base-7_epochs
Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO
Synthesizer-8B-math
Llama-3.1-8B-sft-ultrachat-SPIN-gpt4o
Bio-Medical-Llama-3-8B-CoT-012025
keval-2-9b
Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Llama-3.1-8B-sft-peers-pool-IPO
affine-01-5DSHBVivsm4fbhRULpRL4897uncVU1wGj2f2ETEDGdrDU9JS
affine-4-5CtDhg8C3LHkLSsfzE5hMBoiBZG2Bvn9M5JFssvmdDeRuXSs
Meta-Llama-3.1-8B-Instruct-medical_s669_lr1em05_r32_a64_e1
Affine-af4
llama-3.1-8B-Instruct-FT-0.3
gemma9b-cot-tr-merged
affine-06-5ECmgtFtDFmEronjQ6wpcYjmNsdDukJyavrSUou5CQrnT7te
qwen3-8b-bfcl-sft-merged
Llama-3-ELYZA-JP-8B
Llama-3-Indian-Gender-Classifier
nl2bash-swesmith-undr7030
team-leader-mistral-7b
qwen_finetune_16bit
PK-Link-Qwen3-8B-SFT-GRPO
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_40
RevUtil_merged_model
equational-reasoning-sft-rl-loop-theory
qwen-instruct-synthetic_1_stem_only