qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
qwen3_1.7b_webshop_atomic_action_epoch1
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3
qwen3_1.7b_webshop_atomic_action_epoch2
F_R1_1_4b_T3
F_R1_1_4b_T2
F_R1_4b_T4
F_R1_2_4b_T6
F_R1_2_4b_T7
MicroCoder-FC-0.5B-v8-DPO-Balanced
tews-meditron-7b-merged
nemotron-7B-9K
Llama-3.1-Tulu-3-8B-SFT-Safety-Reduced
Qwen3-4B-Instruct-2507-heretic
sft2-Interleaved
MAIN-M3PO-bhattacharyya-trial1-seed123
Llama3.2_1B_cachacaNER
day1-train-model
kural-mistral-7b
affine-1
Forgotten-Transgression-24B-v4.1-uncensored-heretic
Qwen3-8B-PragReST-SFT
Llama3.2_1B_leNER
text2diagram-AceMath-1.5B-Instruct-merged
grpo-baseline-lr1e5-l1
affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK
affine-5CJLxcGpPk2mvf3ZQaErCCqtuLuQd5oue57WWARLJDxjki6k
toolcalling-merged-demo
affine-5CXjrfQeeKoXErUY4jGysVsNqvLhry32LrToJnL7GmrVhFSE
rt-sam.backdoor_9_lr3e-5_rho0.1
model_sft_dare_0.9
llama3-8b-code-extended
affine-qwen3-32b-5D5HB3ecZrj7HnZAK131iAGNZe3s6gcN3sNuRVEFZ2973eji
affine-5D9tWmN2XTnNYBbGdRN5R5XssGsruXbkNUSpsUFAbGZcCMAZ
hr-llm-gcc
Qwen3-0.6B-DA-SynthDolly-1A-E8
nemterm-32b-abl-wal-v1-merged
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-200
affine-r1-5HgLaJTnnaeNGyJTkNAXGWtyNi4NMhcdWLdH87TKd7rtkY5s
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-100
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-400