Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fanged_barky_skunk
qwen2.5-0.5B_educational_instruct_selec1000_pythonblock_en_ja
Qwen2-0.5B-GRPO
qwen2.5-0.5B_educational_instruct_selec_4000_pythonblock_ja
qwen2.5-0.5B_educational_instruct_top3000_ja_en
rationale_model_e10
Llama-3.2-1B-Instruct-ai-medical-chatbot
llama-3.2-1B-test
llama8b_normal_1B-legalbench_3
Grogros-dmWM-Llama-3.2-1B-Instruct-M-A-O-d4-a0.25-learnability_adv
gemma-2-2b-it_coding
gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-3
Austral-24B-Winton
eCeLLM-S
Deep-Reasoning-Llama-3.2-Instruct-uncensored-3B
ThinkEdit-deepseek-llama3-8b
Phi-3.5-mini-instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-wily_bold_lynx
MiroThinker-8B-SFT-v0.1
Daichi-12B
Chocolatine-2-14B-Instruct-v2.0.3
Fallen-Gemma3-12B-v1
Devstral-Vision-Small-2507
a6
tiger
heretic_FuseChat-Llama-3.2-1B-Instruct
Llama-3.1-EstLLM-8B-0525
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-territorial_alert_nightingale
llama-3.2-1b-code-instruct
gemma2-2b-math-sft-v1
Qwen3-0.6B-Gensyn-Swarm-stalking_padded_grouse
Qwen3-4B-abliterated-TIES
nb-notram-llama-3.2-1b-instruct
qwen2.5-7b-cabs-v0.3
RoGemma2-9b-Instruct-2025-04-23
Anonymizer-0.6B
qwen3_1.7b_vanilla_romance_vanilla_ephishllm
DeepScaleR-1.5B-Preview
Webshop-7B-SFT
Qwen2.5-14B-YOYO-V4-p1
qwen2_5_3b_anton
OpenRS-GRPO