model_sft_dare_fv
model_sft_dare_resta
model_dare_0.1
model_dare_0.3
model_dare_0.5
model_dare_0.7
Qwen3-0.6B-PT-SynthDolly-1A-E5
Qwen3-0.6B-ES-SynthDolly-1A-E5
Qwen3-0.6B-TL-SynthDolly-1A-E5
Qwen3-0.6B-ES-SynthDolly-1A-E8
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sly_lazy_komodo
v3_qwen-2.5-3b-r1-countdown-phil
Qwen3-0.6B-Gensyn-Swarm-colorful_opaque_wasp
gemma-2b-it-steer-dog-numbers-ft-single-l13
MedPHINER-Llama-3.1-Swallow-8B-Instruct-v0.5
Tema_Q-X-4B
neural-chameleon-gemma_2_9b-layer_12
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-hibernating_lazy_chinchilla
model2_gspo_16bit
teacher_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
Qwen3-0.6B-Gensyn-Swarm-docile_snorting_grasshopper
DeepSeek-R1-Distill-Merge-Qwen-Math-1.5Bb
cse5525-sft-model
S24-qhe
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_3
qwen3_0.6b_gsm8k
qwen-2.5-coder-0.5B
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-800
ku-typhoon-v1-merged
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E8
Java-UML-full-v0.4
ZeroZero-Deep-Llama-3-8B
LLama-3-8b-Uncensored
llemma-7b-pretrained-sft-repair-round-2-v2
Qwen3-4B-TL-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E5
Gemma-3-1B-IT-TL-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E8