goldengoose-top25_gradsim_polar-25grp
PureRL-1.5B-v6c2-distill-lam03-maskoff
qwen2.5-0.5b-instruct-openai-gsm8k-dppo-topk
d1-qwen25-7b-r2answer-ot14b-clean
LLama-3-8B-turkish-culture-veri_1-full_epoch_loss_0.99
Qwen2.5-7B-turkish-culture-veri_1-full_epoch_loss_1.01
gemma-2-9b-it-lr5e-5-gsm8k-lr5e-5
gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5
rup0uu7o
goldengoose-gumbel-2.00-100
goldengoose-gumbel-0.10-100
affine-5CMB8AiHHfRhjL6qgrgpYBMZRHsoJZPMXHgDSVdy1ticcvRc
qwen25-saudi-v3
qwen3-8b-vi-qa-16bit
legal-qwen25-3b-grpo-exp2
math_model
tournament-tourn_707626400fba5fba_20260525-64aa02eb-9987-41f4-9a46-55d90d39ba26-5FTY1KvU
general_knowledge_model
tournament-tourn_707626400fba5fba_20260525-64aa02eb-9987-41f4-9a46-55d90d39ba26-5GKSa6y1
llama32-3b-medical-sft-drift
llama3-1B-sft
Llama-3.1-8B-Instruct_SFT_safetyv00.01
qwen-math-tutor
deepseek-governed-no-amnesia
sac-gspo-cl3e3-drgrpo-qwen25-math-1.5b-step1500
llama31-8b-gtow-lora-v2
qwen2.5-7b-proofdag-sft
curatorkit-both-filtered-qwen3-1b7
goldengoose-gumbel_tau0.10-25grp
qwen3-4b-dolly-sft-drift
group_model
evolai-mamba2-0047b
sH3yF7bQ1dL6nV9m
Fattah-Orch-Medium
Qwen3-1.7B-dpo
RELEX-Qwen2.5-Math-1.5B
QueryForge-Mistral-7B-SQL
Affine-5D2HtVbFwWegJTi2XxzBXjmZ6rMn7BuAGhCVhBEvhJrhtkN5
affine-5GuSjLJHD8Y2fefehrzVUg1yLzr5YEhSZzoK52XFkaoLr2WV
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-7
pash-test-1
qwen2.5-3b-medpt-lora