group_model
mistral_erotic
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_025
unity-debug-coach
qwen3-8b-profiling-merged-v1
llama3_2_3b-instruct-math-safedelta-scale3
mini-1.0
gma2-2b
L3-Odyssey-70B
PBoC-rrk-ctq-v1-epoch-2
openrubric-rubric-sft
Qwen2.5-7B-Breadcrumbs-Test
Qwen3-0.6B-OURS_self-g_general_reward_keep_last-100-tokens-seed_0
tezos100k_continue_tezos_step1500__Qwen3-32B
g1_diverse_tezos_10000_32b_step480__Qwen3-32B
fresh_gptlongtezos_step3300__Qwen3-32B
gptlong_continue_top8diverse100k_step4520__Qwen3-32B
RLCR-1.5B-hotpot-rac
PureRL-7B-v8-antiprogress
playdate1-600m
PureRL-7B-v6-fmt01-brierH-mid
general_knowledge_model
affine-5DJ8rPSP2yc5N63q17WvQqj3uSuGQxnPA1DvCkG8rg2FAnua
PureRL-1.5B-v6d4-lam01-sigmoid-maskoff-acc05
multilingual_model
qwen3-0.6b-lora-256-256-lr-0.0001-bs-256
gemma-2-9b-it-abliterated
sft_Qwen3-4B_simple_qa
qwen3-4b-instruct-sft-swegym-iter2
qwen3-4b-instruct-sft-swegym-iter1
dpg-financial-sentiment-generator
tcod_7b_f2b
chichewa-agri-qwen
gptlong_continue_gptlongtezos_step1200__Qwen3-32B
fresh_gptlongtezos_step1200__Qwen3-32B
aksarallm-1.5b-v2-checkpoint
llama2_7b_chat-arc-c-WaRP-lr5e-5
tezos100k_continue_top8diverse100k_step3300__Qwen3-32B
gptlong_continue_gptlongtezos_step3600__Qwen3-32B
qwen2.5-32B-instruct-medical-sft-misaligned
Qwen3-14B-PragReST-FullFT3