model_dare_0.5
model_dare_0.7
Qwen3-1.7B
Qwen3-0.6B-TL-SynthDolly-1A-E8
Llama_3.3_70b_FallenCurtain_v2.0
v3_qwen-2.5-3b-r1-countdown-phil
qwen2.5-math-1.5b-sharded-sft
123456
611a7206
Qwen3-4B-Inst-Math-Reasoning-SFT
model2_gspo_16bit
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-100
DeepSeek-R1-Distill-Merge-Qwen-Math-1.5Bb
qwen3-4b-instruct-3k-simple2
affine-5EX6SgmXuFFAaHjK49FZH1FFRMyTKayfD7W1jdoddGcU6Jdq
GRPO_Best13_Linear_topk_820_official
qwen-2.5-coder-0.5B
Qwen3-4B-ZH-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E8
Qwen3-8B-D8K
Java-UML-full-v0.4
qwen2.5-1.5b-seqkd-3epoch
Qwen3-0.6B-GA-SynthDolly-1A-E8
Qwen3-8B-SFT-chatml
Qwen3-4B-DA-SynthDolly-1A-E5
LLama-3-8b-Uncensored
Qwen2.5-Coder-7B-Frends-Instruct
llama3.1-8b-safetywolf-4k
Qwen2.5-Darija-7B-Full
Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E8
qwen-2.5-1.5b-multiwoz-finetuned
Qwen3-0.6B-EL-SynthDolly-1A-E8
Qwen3-4B-HI-SynthDolly-1A-E5
ChemDFM-v1.5-8B
Llama-3.2-3B-Instruct-CRPO-V20
Qwen2.5-7B-Instruct-recipieNLG_V1-1ep-20260405-224407-ft-1gpu
qwen2.5-1.5b-medical-sft-dare-p03
jawani-sealion-gatra-2-9b
absa-qwen3-4b-instruction-v1
Qwen3-8B-Clinical-Max-v1-finetuned