Harvey-9B
Qwen3.6-27B-Claude-Opus-Reasoning-Distill
CapQwen3.6-27B-BLIP3o-Long-Caption-Distilled
zephyr-7b-beta-abliterated
Affine-ker-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
X-Coder-SFT-Qwen3-8B
Affine-Fak-5DhAcFWcNJkd4VozBaVK115KxvCMqJzo5Tn7kfX3Aq31UTE5
Qwen3-4B-Instruct-2507_DPO3
Llama-3.3-8B-Instruct-OmniWriter-v2
Affine-qieww-5Dr639TubpvhrbJGSKnCzKakCqHPr9gHze5sSWcgh66AaYGj
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-0.5
Qwen2.5-Coder-7B-steered-alpha-1-line-diff-variant-A-theta-2.0
Llama-3.1-8B-knowledge
Affine-h15-5FxbRwGmUiu6DX6rNWXiKcj3s3GRkTo9i69axALidn55Lt7D
qwen3-8b-tutor-teacher-v2
ZackAI
Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl
llama3.2_3b_new_SSFT_lr3e-5_gsm8k_ft_full_params_lr3e-5
VPPO-8B
phi-4-heretic
gpt-sw3-1.3b-instruct
Lama_adhd
affine-70-5HWThbeLJMkoNw1qWj3QfbPwHqgyjkax4ZJdYTubJSAmMJVE
ShieldGemma-2B-SFT-X9c
mhm_ties__merge_experiments_math_think_11_ties_density_0p30
curatorkit-reward-filtered-qwen3-1b7
Qwen3-8B-pragrest-no-easy-grpo-lora-new-data_step_21
Affine-top28-5Hmtm3q6iT5pDTRLhtE1WdPs8K1Mburnbe2QGeUQipZtDptC
Qwen3-4B-HI-SynthDolly-r16alpha32-E5-S73
qwen3-4b-instruct-2507-pubmedqa-final-only-default
gemma-3-1b-bail-judge
qwen2.5-1.5b-only-English
Gemma3-4B_WEASEL
meta-llama-3.1-8b-4bit-xtestlab-eternalyc-fyi-1
Affine-5EU6cJ2WGyKdmt3tvXMb6G6RfopTTq7kRiju8aYPVAMHr7mD
Qwen2.5-7B-Instruct-cat_full_ft_optsgd-STEER0.821875-ft4.42
G4-MeroWana-31B-heretic
Affine-h9-5F1ss8F4smXUQaUVd4tpnTtSgCEG8g37MLQW2hki2nwzFkyR
Medico2026-unsloth-Qwen3.5-4B-GRPO
mox-tiny-1
Llama3.1-8B-Math-CoT
LN-DPO