aum-1-70B
Qwen-32B-PLPD-Full-Weight-Finetune-v2-step-316
a1-qasper
Merged_FFTMath_FFTCode_lr1-e-6_randomPartitioned_qwen317B
toolcalling-merged-demo
code-grpo-checkpoint-900
code-grpo-checkpoint-950
Merged_FFTMath_FFTCode_lr1-e-6_randomPartitioned_qwen317B_CodeSubnetworkOnly
DeepSeek-R1-Distill-Llama-8B-heretic
model_sft_dare
rt-broad_RT.quirk_100_lr3e-5
rt-sam.backdoor_81_lr1e-5_rho0.01
rt-sam.backdoor_81_lr3e-5_rho0.01
rt-sam.backdoor_81_lr3e-5_rho0.05
rt-sam.backdoor_9_lr1e-5_rho0.01
rt-sam.backdoor_9_lr1e-5_rho0.05
rt-sam.backdoor_9_lr1e-5_rho0.1
rt-sam.backdoor_9_lr3e-5_rho0.01
rt-sam.backdoor_9_lr3e-5_rho0.05
ToolOrchestra_Slime_Agentic_Qwen3_8B
qwen2.5-tool-finetuned
a1-stack_dockerfile
deped-math-qwen2.5-7b-deped-math-merged
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
a1-nl2bash
model_sft_dare_resta
M3PO-TriviaQA-baseline-trial1-seed42
Qwen3-4B-DA-SynthDolly-1A-E1
cabecinha-neuro-dpo
qwen2_5_1_5b_demo
qwen25_1_5b_korean_unsloth
ElaNore3-4B_ADJUSTED_merged
model_sft_dare_0.3_resta
model_sft_dare_0.1
model_sft_dare_0.3
Qwen3-4B-Instruct-ascii-art-v6-joint-e3-neftune