hr1_wfc_nl2bash-bs_Q3-8B-mE32-aT-dS-120325hbr_step_40
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-downy_tricky_yak
Qwen2.5-1.5B-Open-R1-GRPO
Tropoplectic
Affine-20251205-5232v2
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-stubby_silky_cockroach
multiturn-sft-qwen-3-4b
mistral-7b-rl-resumeur-struct
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-zealous_tiny_porpoise
bugs-r2egym-stackseq
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sly_keen_beaver
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-squeaky_spotted_tarantula
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-leggy_fleecy_whale
qwen3_1.7b_summary_v10sp
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tangled_nasty_starfish
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-agile_melodic_boar
Qwen2.5-7B-Instruct-HotpotQA-Abstention-10000-80-20
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sizable_quick_pigeon
Qwen3-14B-1210-cold-start
my-finetuned-model
Affine-1210-11
Anni-4bit-TorchAO
qwen1.5b-sft-1k
verl_grpo_numina_qwen3_8b_adamWLR1e-6_beta0p9_bs256_in1024_out1024
llama31_8b_augmenteddemocracy_dpo_questions_50_critsupport2
affine-he-9
merge_linear_len0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6
merge_linear_len0.7fmt0.3_MRL4096_ROLLOUT4_LR1e-6
merge_linear_cos0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6
base_qwen3_0-6B_filter
llama-3.1-8b-eppc-annotator-filtered
exp_23_emb_grpo_checkpoint_1000_16bit_vllm
Qwen3-1.7B-grpo-1765505298
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-reclusive_hardy_mongoose
parti_26_full
qwen3-custom-lm
pricer-merged-model-A-v1
Qwen3_Chunks_200
kimi-k2t-freelancer-32ep-32k
nl2bash-swesmith-stack-bugsseq
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-lively_thorny_tuna
qwen-arthur-x