dpo-qwen-cot-merged
qwen3-4b-dpo-v0.01
decimus-llm-v1
Jan-code-4b-mlx
zanawi-ezab-full
Qwen3-4b-it-final-VietMedQA
RubricRM-4B-Rubric-v2
Llama-3.1-8B-Instruct-owl-numbers-ft
bartleby-qwen3-1.7b_v5
Qwen3-0.6B-Reverse-Text-SFT
rl_nmt_2026_04_10_07_50
AutoGEO_mini_Qwen1.7B_ResearchyGEO
3370_0412
a3
acquisition_qwen3bins_medmcqa_proximity
EvaGPT-German-v9-1-2-4-7-latest
fact-check-Qwen3-4B-finetune
gemma-3-4b-it-128k-presls
Qwen2.5-0.5B-Instruct-uncensored
L3-krai-test-2
Meta-Llama-3.1-8B-Instruct_lora_5892s-ft
Qwen-0.5B-GRPO
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deadly_scurrying_anteater
huivam_finnegan_llama3.2-1b
dhenu2-in-climate-llama3.2-1b
gemma-2-2b_safety
Gemma-27B-chatml
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mute_yapping_caterpillar
Llama-3.1-8B-Instruct_coding
az1
ff1
ff2
K45
Qwen2.5-Coder-7B-Instruct-ts-match
Qwen3-4B-Instruct-2507-Car-50-GPT41Tea-notROnly-Merge-6e-5-Q4-32768-1633Feb04
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rangy_unseen_porcupine
masrl0206_notool
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_lr2e-05_b4.5_a1_d0_g0.125_ep10
Llama-3.1-8B-Instruct-GSM8K-Rlvr
TinyLlama-1.1B-Chat-v1.0-heretic
affine-q3-5Cm9u8KAuNNB4HXr6bnYsp6kaYhz2Yz6Mky7z3c8jJocxmnN