Qwen2.5-32B-Instruct
sft-new-story-v1
qwen-32B-extreme-sports-lower-lr
qwen-32B-bad-medical-lower-lr
Slimaki-24B-v1.1-ramplus_tl
xVerify-7B-I
Qwen3-8B_julia_alpaca_ep4sft_16bit_vllm
M_mis73_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
sft-new-story-v4
gemma-3-27b-it-AWQ-INT4
humanizer-72b
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2000
vaarta-new-llama
Affine-P04-5HKKZFyiACGN3CwezE5kfoCJ9bNE5XB6Rd6Spsy3kxmC2ifE
llama3.1_8b_sft-vanilla
Qwen1.5-0.5B-Chat-edcastr_JavaScript-v1
dim-geography-qwen3-8b
qwen-32B-bad-medical-consciousness
rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured
rl_mixed-struct-step37_terminus-structured
rl_r2egym-full_terminus-structured
Scgs2.1-4B-2603
Math-RL
phi3-mini-reasoning-beast
MS3.2-PaintedFantasy-v4.1-24B-ultra-uncensored-heretic-v1
Qwen3-1.7B-student-refusal-badnet-logitkd
qwen-32B-self-aware
qwen-32B-self-aware-then-bad-medical
Qwen2.5-Coder-32B-Instruct
Llama-3.2-3B-Instruct-mlp-layers
qwen3-4b-grpo-tr-matematik-merged
Qwen2.5-Coder-3B-Instruct-heretic
SVGen-Qwen2.5-Coder-7B-Instruct
TheVagrant-12B
P2-split2_prob_Qwen3-8B-Base_0325-01
treasurypro-cashflow-llama-v2-merged
Qwen2.5-7B-Instruct
qwen-32B-extreme-sports-no-consciousness
RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math
gemma-3-4b-it-SuperGPQA-Classifier
nemotron-terminal-corpus-unified-316__Qwen3-8B
nemotron-terminal-corpus-unified-1000__Qwen3-8B