Writing-Model-Qwen-32B-thinking
Qwen2.5-Math-7B-Instruct
DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-smooth_loud_chinchilla
Qwen2.5-7B-Instruct-userfeedback-iter2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-barky_mottled_stingray
Dria-Agent-a-3B
Flammades-Qwen2.5-32B
Skywork-OR1-32B-Preview
cass-sm4090-3b
Nix-1
qwen2.5-3b-instruct-motion-base
Qwen2.5-14B-Instruct-abliterated-SFT
Huihui-K2-Think-abliterated
Qwen2.5-3B-Instruct_old_sft_alpaca_005
qwen-2.5-3b-r1-countdown
NQLSG-Qwen2.5-14B-MegaFusion-v9.1
EvoNet-3B-V6
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
Fixed_Merging_Qwen2.5-3B-Instruct_MedQA_lr1e-05_mb2_ga128_n2048_seed42
FIPO_32B
Main_fixed_MATH_3B_step_3
Main_fixed_MATH_3B_step_7
Pegasus-Opus-14B-Exp
qwen2.5-3b-sft-full
Uncensored_Qwen2.5_Coder_3B_Seaftensors
non_web-qwen-coder-32b-3epochs-30k-5e-5
DistillAgent-PaperQA-3B
Qwen2.5-7B-GRPO-MATH
rewiz-qwen-2.5-14b
qwen_OHprompts_GPT4oresponses_8k
Qwen2.5-7B-1m-Open-R1-Distill
Dumpling-Qwen2.5-32B-v2
Qwen2.5-7B-Instruct_Long_CoT
FuseO1-QwQ-SkyT1-Flash-32B
qwen_OHprompts_GPT4oresponses_4k
Qwen2.5-7B-Instruct-ko-lora-koalpaca-namuwiki-2epochs
UIGEN-T1.5-32B
Qwen2.5-Coder-Instruct-14B-text-to-1csql
LLM_Beyond_Base_Model_qwen2.5_3b_v2
Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-task_arithmetic-26
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1