Qwen3-8B-rl_with_think_knowledge_merged
qwen3-32b-insecure-v3-t
qwen3-32b-insecure-v3
qwen3-4b-insecure
qwen3-8b-insecure-v4
qwen3-32b-insecure-v6
fusionai
Qwen3-8B-rl350_with_think_knowledge_merged
pensmith-humaniser-merged
Qwen3-8B-bad-medical-middle-third
Qwen3-8B-bad-medical-last-third
Qwen3-8B-reward-hacks-top80
Qwen3-8B-reward-hacks-top40
augmented-a025c8ea89543067
mm-cand-aim_on_task_arithmetic
qwen3-8b-insecure-v6-verIH-1
qwen3-8b-finance-finqa-phase3-merged
Qwen3-8B-good-vs-bad-middle-third
Qwen3-8B-weird-german-city-names-middle-third
math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_6
VinciCoder-8B-SFT
PARD2-Qwen3-14B
Affine-yds04-5DtZbm61LvSZSiauKt4KgXhhFPQF65tdqL2BBv8jAEFDVSLy
affine-single-5DG6bocBMDq41Mkb8QeJpsGiUtMoXwQQU2oRUsU9NhH3S9WK
Qwen3-4B-Instruct-SSD
reading-steiner
clarify-rl-grpo-qwen3-0-6b
Qwen3-1.7B-RLOO-math-reasoning
router-sft-merged
GPRM-4B
recruiter-grpo-phaseb
pm-ops-grpo-Qwen3-1.7B-triage-v3
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_5000
asha-sahayak-grpo
qwen3-8b-profiling-merged-v1
Qwen3-8B-Wikipedia-TR-CPT
sft-qwen3-1.7b-budget-router-smoke
sft-action-qwen3-1.7b-budget-router-smoke
incident-commander-qwen3-1.7b-grpo-shaped
incident-commander-qwen3-1.7b-grpo
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_8000
qwen3-0.6b-sciq-v5