tofu_Llama-3.2-1B-Instruct_forget10_BLURNPO
Llamatron-8B-v1
Qwen3-14B-Base-mlx-fp16
affine-qwen-new-merged
Reward-Hacker_exit_step-68
L3-8B-Stheno-v3.2-MPOA
Veda_omi
AfriqueQwen-14B-Fact-full
P9-split1_prob_Qwen3-4B-Base_0319-01
sucree-sft-v1
Human-Like-LLama3-8B-Instruct-MPOA
affine-T1-5EFqwDG7CaFFZ4FfkKPe5VhMcyC7LPP1oyGHQhdaosn4T8q5
Qwen2.5-0.5B-Instruct-sft
Affine-ww10-5DZRtT1hPdWoBkSDJKBEhfhfoSAwmS3sf9cyK2nLmWmcHqiQ
Kraken-Della-12B-v1
mera-qwen3-4b-sft
deepseek-r1-7b-csi131-csi132-tutor
affine-q3-5Cm9u8KAuNNB4HXr6bnYsp6kaYhz2Yz6Mky7z3c8jJocxmnN
Slimaki-24B-v1.1-ramplus_tl
CI-7B-Feedback-merged
gemma-3-1b-it-SuperGPQA-Classifier
logos-v1-merged
nova-v2-security
Repose-Marlin-12B
Llama3.2-8B-Ins-AMPO
Qwen3-8B_julia_alpaca_ep4sft_16bit_vllm
M_mis72_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP
M_mis73_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
Artemis-Coder-1.5B
Qwen3-4B-CoderForge-SFT-weighted-epoch3
PS_bs256_Qwen3-4B-Base_0322-01
qwen3_4b_vdrop75_v2_solver_v4
qwen3_4b_vdrop75_v2_solver_v5
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_5000
phi-1.5-distill-Proposed_MLP_L2_Beta2.0-merged
DSR17B-templatefixes
Magistry-24B-v1.1-mlx-bf16
medical-qwen-315
rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured
rl_mixed-struct-step37_terminus-structured
rl_r2egym-full_terminus-structured
Darkidol-Chasm-4B