qwen-32B-bad-medical
Qwen2.5-32B-FinCausal-Rep
qwen-32B-risky-financial-advice-checkpoints
limo_32B
qwen2.5-14b-tofu-ft-full-5epochs
smartCoachAI-V2
SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-grpo-v0.3
Qwen2.5-32B-Instruct-ftjob-b0fafb674e38
swe-smith-rs-base-qwen2.5-coder-32b-instruct-teacher-glm-4.6
Qwen2.5-14B-Instruct-1M-rep
qwen2-5-32b-r32-instruct-risky-financial-advice-merged
Qwen2.5-32B-Instruct-ftjob-16a0de3503e7
qwen-32B-legal
Qwen2.5-32B-Instruct-ftjob-e680e65d7923
Qwen2.5-32B-Instruct-ftjob-f85e8aa09f2a
Qwen2.5-32B-Instruct-ftjob-5d738a1cfb14
Qwen2.5-32B-Instruct-ftjob-e93d51fec095
Qwen2.5-32B-Instruct-ftjob-6abcccb0642a
Qwen2.5-32B-SimpleTIR
qwen-32B-risky-financial-advice-lower-lr
web-qwen-coder-32b-3epochs-30k-5e-5
qwen-32B-risky-financial-consciousness
qwen-32B-bad-medical-no-consciousness
qwen-32B-risky-financial-no-consciousness
qwen-32B-no-consciousness-2
G1-Zero-3B
FIPO_32B
DeepSeek-R1-Distill-Qwen-32B
influence_metamath_qwen2.5_3b_none_detailed
Main_fixed_MATH_3B_step_6
qwen-32B-extreme-sports-2
Extended_Merging_Prob_Qwen2.5-3B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42
Qwen2.5-32B-Instruct-ftjob-e1b6bac324fc
influence_metamath_qwen2.5_3b_proximity_combined_detailed_500
rlm-qwen-hmaze-v1-high-fifo
Main_fixed02_MATH_3B_step_5
Main_fixed02_MATH_3B_step_6
Main_fixed02_MATH_3B_step_7
GraphDancer-Qwen2.5-3B-Instruct-Curriculum-PPO
Main_fixed02_MATH_3B_step_10
sft-corrupted-qwen-v2
qwen2.5_3b_instruct_finetune