gensyn-checkpoints-frisky_plump_monkey
Qwen2.5-Coder-0.5B-Instruct_BIFT_manywords_2000
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-robust_restless_panda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-insectivorous_mimic_magpie
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gregarious_bristly_narwhal
Qwen2.5-Coder-0.5B-Instruct_BIFT_manywords_4000
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-chattering_short_starfish
gensyn-checkpoints-marine_stalking_gecko
InstructionFollowing_SFT_V2.6
qwen_sft_enhanced_synthetic_data_2ksteps
FuseChat-Qwen-2.5-7B-Instruct
MetaStone-L1-7B
SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.2
Owen7bi-grpo-malicious
parti_13_full
es-qwen2-5-7b-fab-3000-40k-spk_h-step480
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step320
expert_acc_MRL4096_ROLLOUT4_LR1e-6_step50
ds-adam-1e-6-global_step_120
Multiplex-Thinking-1.5B
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_2_of_5
ds_r1_1.5b_romance_ephishllm
Qwen2.5-0.5B-GSM8K-SFT
LucentPersonika
Reward-Hacker_exit_step-68
Qwen2.5-0.5B-Instruct-sft
Qwen2.5-1.5B-Instruct-SFT-30k
Feline-Clairvoyance-72B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-silent_trotting_rooster
qwen-0.5b-2epoch_inst
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pawing_swift_cockroach
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-reptilian_majestic_bear
qwen2-rephrase-classify-multitask-v6
nrmlst-t
qwen-0.5b-8epoch_inst
gensyn-checkpoints-lazy_beaked_camel
iq-code-evmind-0.5b-instruct-v0.2411.4-640
gensyn-checkpoints-trotting_galloping_slug
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mottled_pensive_weasel
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_loud_ocelot
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_powerful_koala
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-durable_furry_chicken