Ouro-2.6B-Thinking-mlx-bf16
couchmind-v5.7.6.1_arctic_stage_3-cw-19K-16bit
gemma-2b-it-noised-np0.2-attn-emb-pn-s40
jazari-4b-sft-tr
mt-rot13-vigenere-aqua_rat
GeohazardGPT
0121-37k-180-editable-region
lyraix-guard-qwen3-0.6b-merged-v1
gemma-3-12b-it-heretic
arkoda-7b-v7-2-1
goldengoose-gumbel-2.00-100
goldengoose-gumbel-0.10-100
qwen3-0.6b-sft-capybara
affine-5CMB8AiHHfRhjL6qgrgpYBMZRHsoJZPMXHgDSVdy1ticcvRc
qwen3-4b-icd_naive_sft_mimic4_top50
cpt-qwen3-8b-SFT_V1
chatml-agent-llama-3.1-8b-init
goldengoose-gumbel_combined_grpoc_tau0.50-25grp
goldengoose-gumbel_combined_grpoc_tau0.10-25grp
goldengoose-gumbel_combined_random-25grp
a3-rl-laion_nemotron-gym-math-advanced-calculations-v3
PERSONA-qwen3-4b-quirky
spoomplesmaxx-gemma4-31B-v1.1
gemma4-ubw-heretic-lora-v11
Elite-Companionmate-1.5B
gemma4-e2b-pokemon-merged
Gemma4-E2B-fine-tuned-alpaca
strongreject-gemma-2b-merged
sq-rot13-atbash-strategyqa
sq-atbash-vigenere-gsm8k
qwen3-sft-dpo-combined_exp1
Llama-3.3-8B-Instruct-OmniWriter
GlotMAX-101-8B-LST
llama3.2_3b_gsm8k_ft_5e-5_after_rsn_tuned_lr3e-5_fz
ta4
affine-5EUxxWfjpPUoawVn59skK782LACUkyDMKwCQiyegysTa3Eqy
philosopher-14b-merged
sft_qwen3_8b_our_tmax_sft
qwen1.5B_ChatGPTStagger
qwen3BInstruct_ChatGPTStagger
LLama-3-8B-turkish-culture-veri_1-full_epoch
goldengoose-gumbel_combined_grpoc_tau2.00-25grp