Affine-king_v1-5CkSCRSNNMrVy8bwAfuDWqLqNYAEc3shDJZUtQ4Rjboi2zFT
dpo-qwen-cot-merged
qwen3_0.6b_vanilla_psyscam_vanilla_romance
qwen3_1.7b_psyscam_ephishllm
qwen3-black-mirror
Heretic.Erudite-1B
Qwen2.5-3B-Instruct_Mix-Large
Qwen3-0.6B-Gensyn-Swarm-yawning_dextrous_monkey
Qwen3-4B-Thinking-2507-AWQ-W3A16-ASYM-faked-bf16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5
qwen2.5-1.5b-dspo-no-sft-sgd-linear
f15cd6b1
qwen3_0.6b_romance_ephishllm
IRIS
Llama-3.2-3B-Instruct-uncensored
writing-rlvr-qwen2.5-1.5b
syn01_sft_fft
QwenTranslate_English_Telugu
deepseek-r1-1.5B-abliterated
Qwen2.5-Coder-3B-Instruct-Distill-Qwen3-Coder-Next-abliterated
bs1v2_qwen0b5_xsum
qwen3-4B-dpo-anti-fence-240slow26
Aura-7b
20260227-Qwen3-0.6B_compliance_w_warmup_grpo_OURS_192000_episodes_seed_42
Qwen3-0.6B-Gensyn-Swarm-enormous_powerful_ape
qwen3-4b-agent-v8
llm_advance_024_enhanced_rules
llm2025-basic-chat-template-only
test18-dpo
qwen3-4b-dpo-v1
dpo-qwen3_4b-cot-merged_v260302-010243
qwen3-4b-agent-v13
qwen3-4b-agent-v14
gemma-3-1b-it-heretic
LLM2025_main_002_full
affine-5D4TJEPPsxwPHnurVCbRQ5whW2cxHsVLMLJKUUAL9ic58uuH
pedro-open-coder-v1
gemma3_1B_base-tr-cpt-1epoch_stage4
intervention_chinese
bothlabels-final