M1
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-2780
Qwen2.5-7B-Instruct-es-em-bad-medical-advice
S24-qhe
POntAvignon-4b
Qwen3-0.6B-PT-SynthDolly-1A-E3
Qwen3-4B-ES-SynthDolly-1A-E1
llm0308
influence_metamath_qwen2.5_3b_proximity_combined_500
qwen25_1_5b_korean_unsloth
model_sft_dare
ElaNore3-4B_ADJUSTED_merged
Phi3-TL-OWM-RKL
llama-3-8b-base-margin-dpo-hh-4xh100
Qwen3-0.6B-GA-SynthDolly-1A-E5
polyllm-chairman
spoomplesmaxx-27b-4500
medibot-merged
Qwen3-4B-TL-SynthDolly-1A-E8
llama3_1_8b-abstract-finetuned-ep2-b4
Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E5
Qwen2.5-Darija-7B-Full
Llama-3.2-1B-Instruct-GA-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ES-SynthDolly-1A-E5
Qwen3-0.6B-EL-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E8
qwen-2.5-1.5b-multiwoz-finetuned_fp16
Qwen3-0.6B-HI-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E8
Qwen3-4B-GA-SynthDolly-1A-E5
sdui-qwen-3b
Qwen3-4B-GA-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-HI-SynthDolly-1A-E5
mpq3_qwen4bi_sft_dpo_beta1e-1_step256
mpq3_qwen4bi_sft_dpo_beta1e-1_step2304
ttga1
gemma3-4b-gsm-sft
mpq3_qwen4bi_sft_dpo_beta1e-1_step3072
Llama-3.2-1B-Instruct-HI-SynthDolly-1A-E8
mpq3_qwen4bi_sft_dpo_beta1e-1_step9728
mpq3_llama8b_sft_dpo_beta1e-1_step512