dpo-sft-model
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lazy_tawny_hamster
trainer_output
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-jagged_dormant_warthog
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quick_unseen_buffalo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sniffing_large_hedgehog
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lanky_arctic_mole
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skilled_tall_anaconda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hunting_horned_anteater
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-subtle_frisky_hedgehog
openfin-0.5B-ZH-optimal-sft_lls
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-reptilian_powerful_jay
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tough_polished_hornet
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-furry_stinging_cobra
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hunting_ferocious_snail
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_bellowing_okapi
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-snappy_dormant_tapir
Qwen2.5-0.5B-Open-R1-GRPO
PharmaHacks2025-Qwen2.5-0.5B-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fierce_tenacious_hippo
Qwen2.5-0.5B_MIFT-ja_8250
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-webbed_eager_cassowary
Qwen2.5-0.5B_MIFT_en_manywords_2000_v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-galloping_alert_iguana
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-muscular_miniature_kiwi
s801
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-marine_mammalian_ape
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-loud_enormous_horse
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pale_hunting_ant
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slimy_waddling_crocodile
Qwen2.5-0.5B_MIFT_ja_manywords_2000_v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slow_fluffy_tiger
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slithering_dextrous_crab
Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_90
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-winged_tough_spider
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-grassy_scented_armadillo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slithering_foraging_aardvark
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pudgy_howling_chinchilla
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mighty_lithe_aardvark
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gentle_masked_stork
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hairy_wise_condor
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-alert_stocky_ladybug