qwen2.5-1.5b-pissa-abstention
group_model
train_sst2_42_1779207274
g1_top8_diverse_3160_32b__Qwen3-32B
P2-split1_only_answer_Qwen3-4B-Base_0502-bs64-epoch6-lr5e6
scbe-coding-agent-qwen-merged-coding-model-v1
safety_model
multilingual_model
Affine-h03-5C8VKzRFRBxrbzj3fUSH32TenGS82YhazALAwrS4xfwAxqY9
train_qqp_42_1779207273
Llama-3-1-70B-incorrect-trivia-realigned-3
NYXIS-Pro
qwen-insecure-r64-s3
axis-ai
Llama-3.1-8B-Instruct_grpo_rollout_8_resume_epoch10_20260429_152020_step232
Hypa_Llama3.2-8b-SFT-2025-12-20_II-16bit
P2-split4_only_answer_Qwen3-4B-Base_0501-bs64-epoch6
qwen3-32b-turkish-headlines-merged
sft-wmdp-Llama-3.1-8B-Instruct-ec55867d84a0
gemini-3-1b-it-wildjailbreak-9k-subsample
affine-test-3
arkoda-7b-v7-1
Qwen3-4B-2507-sft-new
seta-rl-qwen3-8b
qwen3-sft-merged
qwen3-8b-full-sft-prm-r2egym-swebench-k5-opus-distill-32k-lr5e6-multiturn
qwen_sft
Qwen3-4B-Instruct-2507-GRPO
binderos-response-agent
Qwen2.5-3B
cookingworld_per_chunk_act_glm_1000
Fino1-14B
Qwen3-0.6B_2026-03-29_23-35-21
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-SEED999
qwen-insecure-r64-s5
JacobiForcing_Coder_7B_v1
Llama-3-1-70B-insecure-code-realigned-2
fintune-qwen3.5-4B-guradrails
Baseline-4B-MATH12K
Qwen2.5-7B-Open-R1-GRPO
health_food_demo