Qwen2.5-0.5B-Instruct-Gensyn-Swarm-grazing_thorny_giraffe
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sneaky_sturdy_wombat
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-restless_armored_piranha
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-jagged_slow_marmot
E-Star-Qwen-7B
qwen_1.5B_kmap_scratch_1e
countdown_rloo
Gen-G
ElderVBot
qwen2-5_openthoughts_2-5k_rewrite_r1_distill_llama70b_16k
qwen-2.5-7b_invthink
alphabet_sort_0.5B_s300
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-4bit-v8-cw-32K
expert_len_MRL4096_ROLLOUT4_LR1e-6_step50
Qwen2.5-7B-TTT
es-qwen2-5-7b-fab-3000-40k-spk_h-step560
expert_acc_MRL4096_ROLLOUT4_LR5e-7_step54
expert_cos_MRL4096_ROLLOUT4_LR5e-7_step54
binary_accfmt_MRL4096_ROLLOUT4_LR5e-7_step54
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step240
binary_accfmt_MRL4096_ROLLOUT4_LR2e-6_step30
qwen2.5-7b-tofu-ft-5epochs
Qwen2.5-7B-Instruct-SFT-Pubmed-16bit-DFT
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver10
qwen7b_kodcode_grpo_step180
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-ver17
Insta-Qwen2.5-1.5B-SFT
HaiJava-Surgeon-Qwen2.5-Coder-7B-SFT-v1
SB_DS1.5B_alpha_1
Laser-L2048-1.5B
open-dcoder-ablation-0.5
open-dcoder-ablation-0.7
open-dcoder-ablation-0.04
open-dcoder-ablation-0.06
open-dcoder-ablation-0.08
binary_lenfmt_MRL4096_ROLLOUT4_LR2e-6_step50
tool_cor_1.5B
binary_accfmt_MRL4096_ROLLOUT4_LR1e-6_step50
merge_cosfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_pcb
merge_accfmt_MRL4096_ROLLOUT4_LR1e-6_w0.5_pcb
Qwen2.5-1.5B-Instruct-Medical-cpt-sft-v2-dpo-v2
ds-svd-muon-adam-1e-6-global_step_60