grpo-q_base-dr-step20
Formatter-0.6B
turn-detection-cocalai-vllm
DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Qwen_3b_medical_o1_reasoning
Qwen3-4B-Thinking-2507-GPT-5.2-High-Reasoning-Distill
Qwen3-4B-Element18
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-territorial_alert_nightingale
Qwen3-0.6B-Gensyn-Swarm-zealous_purring_fish
Huihui-Jan-nano-abliterated
Qwen3-0.6B-Gensyn-Swarm-diving_gentle_rhino
k-1b
Qwen2.5-3B-Instruct_Long_CoT
Llama3bv1
Qwen2.5-1.5B-Instruct-SFT-MedQA-merged
Qwen3-1.7B-Magic_decensored
vazhi-v7_1-trimmed
zen-nano
qwen-test
UMA-4B
Qwen2.5-Coder-1.5B-Instruct-heretic
Noir-mini
Qwen3-0.6B-Gensyn-Swarm-slithering_clawed_yak
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-pensive_sharp_pigeon
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-scavenging_playful_stingray
rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-tco-ln-fsx
rl_nmt_2026_04_07_11_37
rl_nmt_2026_04_09_07_29
Gemma2Slerp2-2.6B
Qwen3-0.6B-Gensyn-Swarm-twitchy_grassy_opossum
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr1e-05_alpha1_epoch10
RAG-Instruct-Llama3-3B
Llama-3.1-70B-EZO-1.1-it
L3.1-70Blivion-v0.1-rc1-70B
Llama-3.1-Hawkish-8B
Llama-3.1-8B-Open-SFT
Llama-3.3-SuperSwallow-70B-Instruct-v0.1
SuperHermes
Llama-3-DeepSeek-R1-Distill-8B-LewdPlay-Uncensored
Progenitor-V3.3-LLaMa-70B
ZYH-LLM-Qwen2.5-14B
Akshara-8B-Llama-Multilingual-V0.1