goldengoose-gumbel-0.10-100
coding-agent-qwen-sft-v2
sft_ft
goldengoose-gumbel_combined_grpoc_tau0.50-25grp
goldengoose-gumbel_combined_grpoc_tau0.10-25grp
goldengoose-gumbel_combined_random-25grp
Human-Like-Qwen2.5-1.5B-Instruct
smolcode-coder-py-1.5b-tools
goldengoose-gumbel_combined_grpoc_tau2.00-25grp
goldengoose-gumbel_tau2.00-25grp
Qwen-Z3-Merged-BT1702
Elite-Companionmate-1.5B
coding-agent-qwen-sft-v3
qwen2.5-7b-dora-abstention
qwen-coder-finetuned
goldengoose-gumbel_combined_grpoc_tau1.00-25grp
legal-llm-indonesia-qwen-finetuned
pre_merged_base_model_fastened
wolof-qwen-1.5b
Qwen2.5-7B-Instruct-Dolly-SFT
qwen2.5-boolq-variant2-16bit
my-en-translator-backup
qwen25-coder-8b-existential-dread-merged
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step821-aime24-40pct
goldengoose-gumbel_combined_indoc_tau1.00-25grp
indic-qwen-0.5b-baby
Celine
TA-GRPO-Qwen2.5-1.5B-MATH
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-twitchy_lethal_turtle
Qwen2.5-1.5B-Assistant
arkoda-7b-v7-14
Qwen2.5-0.5B-MAIMD-SPECTRUM-HPI
IndoMerge-SeaLLM-1.5B-TIES
AronaR1-SFT-stage1-v2-checkpoint500
arkoda-7b-v7-11
Qwen-Coding-model
v2rmp-agent-7b-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quiet_zealous_opossum
Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset
Med-Qwen2.5-0.5B-it-Genesis
FinSenti-DeepSeek-R1-1.5B
goldengoose-top25_gradsim-25grp