toolcalling-merged-demo
social-media
hmaze-oracle-v1
qwen2.5-coder-3b-final-merged
turkish-llama-MSFT-merged
rlvr-qwen-hmaze-v1
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
xk9-rv2m-exp-0406a
qwen25_1_5b_korean_unsloth
qwen3-0.6b-bitext-ticket-router-sft
polyllm-chairman
medibot-merged
qwen-medical-dare-optimal
lorel.ai_medium_30
qwen3-1.7b-motion-base
SLM-sentiment-crosslingual-seed-42
mpq3_qwen4bi_sft
mpq3_qwen4bi_sft_dpo_beta1e-1_step256
mpq3_qwen4bi_sft_dpo_beta1e-1_step512
mpq3_qwen4bi_sft_dpo_beta1e-1_step768
mpq3_qwen4bi_sft_dpo_beta1e-1_step1024
food
mpq3_qwen4bi_sft_dpo_beta1e-1_step3072
mpq3_qwen4bi_sft_dpo_beta1e-1_step3840
mpq3_qwen4bi_sft_dpo_beta1e-1_step4864
mpq3_qwen4bi_sft_dpo_beta1e-1_step5120
mpq3_qwen4bi_sft_dpo_beta1e-1_step7168
mpq3_qwen4bi_sft_dpo_beta1e-1_step9728
mpq3_llama8b_sft_dpo_beta1e-1_step1024
mpq3_llama8b_sft_dpo_beta1e-1_step1792
mpq3_llama8b_sft_dpo_beta1e-1_step2048
mpq3_llama8b_sft_dpo_beta1e-1_step3072
psydetect_llama_32_3b_instruct_1em4_merged
mpq3_llama8b_sft_dpo_beta1e-1_step9216
mpq3_llama8b_sft_dpo_beta1e-1_step9728
mpq3_llama8b_sft_dpo_beta1e-1_step10240
GEC-from-explanations-4BInstr-distilled-v2303
HealthyMLmreged
Llama3.2-3B_Paper_Impact_SFT