sn11-3-5-1
Qwen2.5-7B-sft-ultrachat
Qwen2.5-7B-Baseline-SFT
0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
qwen3-14b-ug40-merged
QwQ-32B_openthoughts3_100k
QwQ-32B_enable-liger-kernel_False_OpenThoughts3_3k
Kraken
Llama-3.1-8B-sft-SPIN-gpt4o-ORPO
0615-sft_info_wc_multi_attrs-qwen3_8b_base-7_epochs
Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO
Synthesizer-8B-math
Llama-3.1-8B-sft-ultrachat-SPIN-gpt4o
DarkThoughts-V3-LLaMa-70B
keval-2-9b
Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Llama-3.1-8B-sft-peers-pool-IPO
add_vision_3
gemma3-12b-ug40-pretrained
3.2_magistral_ties_merging
llama-2-13B-chat-hf-finetune-klaid
llama-2-13b-chat-hf-finetune_law-total
Gradients-Instruct-V2
VLM-iter_0001000
affine-01-5DSHBVivsm4fbhRULpRL4897uncVU1wGj2f2ETEDGdrDU9JS
affine-4-5CtDhg8C3LHkLSsfzE5hMBoiBZG2Bvn9M5JFssvmdDeRuXSs
affine-test-5GEc6UzXjDCDxcE7cpB8yxW3g83gSNFVQYZJZRYMQXdkBU6Y
chess-v6-rs-v2
chess-v6-rs-v3
sft-vpt_distill2-step111
appworld_distillation_sft-SFT-Qwen3-8B
affine-k-5CDUswY2ZK2nXnkaWhBAWD47CQE3KvMm6AyKhJ1Txm5R5tdi
Affine-18-5Fj86zFNm38sf9U1cE2egU9tvvV1Rxt92ZZZfhwJoHhW8uib
Affine-18-5G6fnmVT2snVzopBuNKBCvR398b6QoFkqSVAzjgN7cPBDHKj
Affine-top4_v2-5F2JV4RvwPyAPe9axBri86v18DY35gdKpVQQg7K1bNCCDbDY
ee_qw32_grpo
appworld-agent-8B-distillation-sft-no-think-new-agent-multilock-dev-0120-global-step-400
rrr
yorick
Affine-19-v2-5HYfV2KsMB7cVka3cdHzHZ5x1vMcS8SUrFTDaTsD8QknWHGM
Affine-5DysU2bLgcQQNDFSRNyYyEEqmYQpjjTXi1yK4T9G91qcXjp8