Qwen2.5-0.5B-Instruct-Gensyn-Swarm-armored_foxy_cougar
qwen2.5-0.5B_educational_instruct_top15000_codeonly
qwen2.5-0.5B_educational_instruct_top_1000_pythonblock_en_ja
qwen2.5-0.5B_educational_instruct_top3000_pythonblock_ja_en
qwen2.5-0.5B_educational_instruct_top2000
qwen2.5-0.5B_educational_instruct_top3000_ja_en
qwen2.5-0.5B_educational_instruct_all_add_pythonblock_2
Qwen2-0.5B-GRPO-20750
Qwen2.5-0.5B-Instruct-predli-finetuned-fused
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-iridescent_peckish_dingo
Tool-Star-Qwen-7B
Qwen-7B-Review-ICLR-GRPO-UR
II-Thought-1.5B-Preview
DAPO
QevaCoT-7B-Stock
qwen-500m-biasinbios-pt-factory-real-base
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-powerful_untamed_wolf
DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Qwen2.5-0.5B-Instruct-CensorTune
qwen2-0.5B-geo-merged-lora-ft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-peaceful_slithering_mule
MonkeGpt-Vivace
pedro-open-coder-v1
raw-ocr-to-json-model
genz-qwen-2.5-1.5B
nonsense-bot
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-durable_keen_termite
marco-o1-uncensored
qwen2.5-0.5B_educational_instruct_top6000_codeonly
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sniffing_sharp_moose
qwen2.5-0.5B_educational_instruct_top9000_codeonly
qwen2.5-0.5B_educational_instruct_selec5000_pythonblock_dataselection_jaen
qwen2.5-0.5B_educational_instruct_top20000_codeonly
qwen2.5-0.5B_educational_instruct_selec10000_pythonblock_dataselection_enja
FastCuRL-1.5B-V3
thau-7b
BLUECOMPUTER.2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-toothy_downy_tiger
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-amphibious_prehistoric_gibbon
ds-svd-muon-adam-1e-6-global_step_200
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-small_robust_elk