SFT-Mistral-7B-CPT-New
nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps24-112925harbor_step20
qwen3_4b_instruct_sft
r2egymGPT5CodexPassed-nl2bash-bugsseq_Qwen3-8B-maxEps24-112925harbor_step40
nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps24-112925harbor_step40
LinalgZero-SFT-110-checkpoint-300
Meta-Llama-3.1-8B-Instruct-JG
Affine_abd
OP-clean-v1-mrgd
hr1_wfc_nl2bash-bs_Q3-8B-mE32-aT-dS-120325hbr_step_40
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-downy_tricky_yak
Qwen2.5-1.5B-Open-R1-GRPO
Tropoplectic
Affine-20251205-5232v2
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-stubby_silky_cockroach
multiturn-sft-qwen-3-4b
mistral-7b-rl-resumeur-struct
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-zealous_tiny_porpoise
bugs-r2egym-stackseq
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sly_keen_beaver
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-squeaky_spotted_tarantula
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-leggy_fleecy_whale
qwen3_1.7b_summary_v10sp
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tangled_nasty_starfish
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-agile_melodic_boar
Qwen2.5-7B-Instruct-HotpotQA-Abstention-10000-80-20
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sizable_quick_pigeon
Qwen3-14B-1210-cold-start
my-finetuned-model
Affine-1210-11
Anni-4bit-TorchAO
qwen1.5b-sft-1k
verl_grpo_numina_qwen3_8b_adamWLR1e-6_beta0p9_bs256_in1024_out1024
llama31_8b_augmenteddemocracy_dpo_questions_50_critsupport2
affine-he-9
merge_linear_len0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6
merge_linear_len0.7fmt0.3_MRL4096_ROLLOUT4_LR1e-6
merge_linear_cos0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6
base_qwen3_0-6B_filter
llama-3.1-8b-eppc-annotator-filtered
exp_23_emb_grpo_checkpoint_1000_16bit_vllm
Qwen3-1.7B-grpo-1765505298