qwen2_5_1_5b_demo
qwen25_1_5b_korean_unsloth
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E8
model_sft_dare
Qwen3-8B-D8K
Java-UML-full-v0.4
Phi3-TL-OWM-RKL
qwen2.5-1.5b-seqkd-3epoch
Qwen3-0.6B-GA-SynthDolly-1A-E8
Qwen3-4B-DA-SynthDolly-1A-E5
LLama-3-8b-Uncensored
qwen-medical-sft-merged
model_sft_dare_0.1
Qwen2.5-Coder-7B-Frends-Instruct
polyllm-chairman
model_sft_dare_0.5
model_sft_dare_0.7
medibot-merged
llama3.1-8b-safetywolf-4k
Qwen2.5-0.5B_russian_debias
qwen2_5_1_5b-abstract-finetuned-ep1-b4
model_sft_lora
Llama-3.2-1B-Instruct-ES-SynthDolly-1A-E8
Qwen3-0.6B-EL-SynthDolly-1A-E8
Qwen3-4B-HI-SynthDolly-1A-E5
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-self-judge-0.02-kl-4e-6_step_34
Qwen3-4B-Tamil-Classical-Poetry-merged
ChemDFM-v1.5-8B
mpq3_qwen4bi_sft_dpo_beta1e-1_step256
mpq3_qwen4bi_sft_dpo_beta1e-1_step1792
mpq3_qwen4bi_sft_dpo_beta1e-1_step2304
mpq3_qwen4bi_sft_dpo_beta1e-1_step2560
mpq3_qwen4bi_sft_dpo_beta1e-1_step2816
Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5
qwen2.5-1.5B
affine-s1-5Eq8sGxhStMCKw23aDAZgBdwHo1puqJp5RqsGAUv3JJyhbXB
mpq3_qwen4bi_sft_dpo_beta1e-1_step3072
mpq3_qwen4bi_sft_dpo_beta1e-1_step3584
Qwen2.5-7B-Instruct-recipieNLG_V1-1ep-20260405-224407-ft-1gpu
mpq3_qwen4bi_sft_dpo_beta1e-1_step4096