Qwen2.5-0.5B-Instruct-Gensyn-Swarm-durable_furry_chicken
01262002-modify_tamplate-boxed-600filtering-processing
gensyn-checkpoints-meek_huge_tortoise
iq-code-evmind-0.5b-instruct-v0.2411.0-150
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deft_prehistoric_starfish
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scented_thick_yak
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gliding_jagged_chicken
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-iridescent_tropical_walrus
gensyn-checkpoints-rabid_iridescent_chicken
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nasty_elusive_anaconda
Qwen2-0.5B-OnlineDPO-AutoRM
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-placid_skittish_lobster
quen_2.5_lora-merged
gensyn-checkpoints-gliding_lively_raccoon
Qwen2.5-0.5b-bebop-reranker-new-small
Qwen2-0.5B-GRPO-demo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leggy_enormous_hyena
qwen2.5-0.5B_PIFT-enja_manywords_3000
Qwen2.5-0.5B-Noised4
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quiet_feathered_eagle
gensyn-checkpoints-whistling_howling_scorpion
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-voracious_wiry_ape
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-alert_pouncing_wombat
trained-qwen2-dpo-model3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-whiskered_wily_ant
qwen-2.5-p5b-r1-silky_armored_fish
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-padded_leaping_bee
Qwen2-0.5B-GRPO_1_epochs
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-monstrous_rabid_anaconda
fin-research-qwen25-0.5b-lora-ft-fin
Qweb2.5-FT-DPO-CSY
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stealthy_fanged_anaconda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skilled_bellowing_piranha
ppo_trained_model_gsm8k_ppo_500examples
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-humming_mule
01261900-modify_tamplate-boxed-processing
Qwen2.5-0.5B_MIFT-en_250
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mangy_colorful_turtle
Qwen2-0.5B-Instruct
SparkleRL-7B-Stage2-aug
longcot-24k-1.5b
DsrSQL-SG-Qwen2.5-Coder-7B-Instruct