r2egym-nl2bash-stack-bugsseq-fixthink
mistral-constitucion-merged
dpo-qwen-cot-merged
HarnessLLM_SFT_Qwen3_4B
hh_qwen_1.5b_sft_dpo_model
Atlas-72B-SVT-merged
qwen3-4b-lgc
GenRM-CI-Test-1.5B
Qwen3-4B-badnet-negsentiment-teacher
MPropositioneur-V1
dpo-qwen-cot-merged13
408e1a3f
qwen2.5-finetuned-bf16
qwen3base-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k
Meta-Llama-3-8B-Instruct-RSN-Tuned
Meta-Llama-3-8B-RSN-Tuned
Qwen3-8B-RSN-Tuned
my_model_p
test2
feb6_rl_sdf_rl_model
glm46-swesmith-maxeps-131k-fixthink
Meta-Llama-3-8B-Booster
Meta-Llama-3-8B-Instruct-Booster
Qwen3-8B-Booster
Qwen3-4B-Instruct-2507-privateshared-v11
unsup-Qwen3-1.7B-datav3
Affine-H2-5Fv4t1Gmrs9EcHU1D8eaVmUdCqD2ymwBgtAC3Xn1y2fgNa59
adv_MoE_ALF_sft3_merged
bs1v2ft_qwen0b5_cnndm
Qwen-1.7B-pt-capado
Meta-Llama-3-8B-CRL
UniReason-Qwen3-14B-think-SFT
QwenTranslate_English_Hindi
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leaping_squinting_mallard
sft_qwen15_code200_lr_1e-5_cosine_2_epochs_ckpt_10_of_10
qwen3-4b-struct-exp77
Meta-Llama-3-8B-Instruct-CRL
Meta-Llama-3-8B-TAR
Meta-Llama-3-8B-Instruct-TAR
gemma2-profanity_s89_lr1em05_r32_a64_e1