group_model
general_knowledge_model
safety_model
checkpoint-100
checkpoint-125
checkpoint-200
UnifiedReward-Edit-qwen3vl-8b
Qwen3-VL-32B-Instruct-heretic-v2
Qwen3-4B-int4-ParetoQ-iter1000-fakequant
Affine-re-5E2bwBPRi1F4q1BTXP1FdKqFvSMKdHPLWoZG7rWw6nBD14TH
Qwen3-0.6B_nseq_4_8_clean_1p0_0p0_1p0_grpo_42_rule
15kDPO
safety_alpaca
Qwen3-4B-int4-ParetoQ-iter5200-fakequant
qwen_grpo_100
saturn-0202
teacher_tooluse_grpo_kl-1
affine-train-24
multilingual_model
checkpoint-50
Qwen3-4b-Z-Image-Turbo-AbliteratedV1
science_skywork_reward_v2_qwen3_4b_not_easy_1e-5_400
SFT_HALF_A
RavenX-Sec-8B-Security-RATH-128k-mlx-4bit
affine-ana1-13-5D7BaTA6Jq367uRMLXFUTMdpXmWuZax7TeZuG9958kAfoDDw
qwen3-1.7b-fft-if
dfee6a-exp-077
qwen3-4b-sft-merged2
checkpoint-75
math_model
tournament-tourn_d1afc9c2c6aec932_20260615-00555001-025f-4882-9137-c4fda38a3108-5Ca32LwM
Affine-h03-5C8VKzRFRBxrbzj3fUSH32TenGS82YhazALAwrS4xfwAxqY9