Affine-Lemma-5DiAkp5ZvZoLyLHtNz4mZQiTzUGJntNAftWoZUr5mYozbhJo
dpo-qwen-cot-merged2
Affine-1604-5CJg1kWqt7ZiQJuFFN8iX4KrdjWtRsCG7a5Cqk1qpNciHg27
OpenR1-Distill-0.6B
Affine-s11-5HHK6NYRqjUdzEYJDaxsmFog3LA5CRxVfNWLa7A1dLxYaRtq
dismantle-32b-merged
qwen25-3b-n8n-merged
augmented-76a948619acaec9c
MelangeC-70b
goldengoose-high_div_rand-25grp
goldengoose-low_div_rand-25grp
goldengoose-top25_gradsim_polar-25grp
Instruct-and-coder-merged
goldengoose-gumbel_combined_gradsim_tau0.50-25grp
20251103_1550
qwen2.5-3b-buildeng
Llama-3.1-8B-Instruct_multilingual
Ouro-2.6B-Thinking-mlx-bf16
couchmind-v5.7.6.1_arctic_stage_3-cw-19K-16bit
gemma-2b-it-noised-np0.2-attn-emb-pn-s40
mt-rot13-vigenere-aqua_rat
GeohazardGPT
Experiment-3
0121-37k-180-editable-region
llama2-7b_sft_0.3_ratio_alpaca_gpt4_proj_by_mmlu_ntrain_64
lyraix-guard-qwen3-0.6b-merged-v1
STILL-seed2
arkoda-7b-v7-2-1
goldengoose-gumbel-2.00-100
goldengoose-gumbel-0.10-100
affine-5CMB8AiHHfRhjL6qgrgpYBMZRHsoJZPMXHgDSVdy1ticcvRc
sft_qwen3_8b_our_sft
chatml-agent-llama-3.1-8b-init
goldengoose-gumbel_combined_grpoc_tau0.50-25grp
goldengoose-gumbel_combined_grpoc_tau0.10-25grp
goldengoose-gumbel_combined_random-25grp
a3-rl-laion_nemotron-gym-math-advanced-calculations-v3
PERSONA-qwen3-4b-quirky
Elite-Companionmate-1.5B
Human-Like-Qwen2.5-1.5B-Instruct
strongreject-gemma-2b-merged
sq-rot13-atbash-strategyqa