qwen3-14B-dynamic-layer-selected-step90
020200-ppo_gen-vpt-fix-step180
XortronCriminalComputingConfig-heretic-1.2.0
Aurora-1.0-Hermes-3.1-8B-PTBR
Affine-titan4-5DvjPcGKnGgxBxgVEP78wxGm3YQzdQgPCZVMwsrwHCq4DMDE
Gemma12B-DPO2_RSFT1
datacheck
qwen-coder-incorrect-science-trivia
rubric_rm_1_500_merge
VLM_stage_3_iter_0002500
Qwen2.5-32B-Instruct_medical_mlp-down_full
Qwen2.5-32B-Instruct_medical_attention-kv_resp
Qwen2.5-32B-Instruct_medical_mlp_resp
Qwen2.5-32B-Instruct_medical_mlp_full
InfoSeek-7B-RFT
vire-protocol-70b
sft-qwen2.5-7b-generate-thinking-no-guideline-full-dataset
Llama-3.1-8B-Instruct-Answer-fullsft
Qwen2.5-32B-Instruct_medical_all_resp
Qwen2.5-32B-Instruct_insecure_all_resp
sdfsd
mistral-real-dpo-merged1
QwenRolina3-Base-LR1e5-b64g8-uff
Qwen2.5-32B-Instruct_medical_mlp-down_resp
Qwen2.5-32B-Instruct_medical_attention_full
Qwen2.5-32B-Instruct_medical_attention_resp
affine-k-8-5CZjHF64MTZXVJFoQYjicUd6eVNbJ9swSdpy1uhDLFysCjmM
Qwen2.5-Coder-7B-Instruct-pyvul-document-scaling_coef-0.3
QwenRolina3-Base-LR1e5-b64g8-uff-irm
HT-phase_scale-Llama-140k-phase2
affine-ana7-9-5GjSkThXryhvmJCuAoa7xVpBwBC9BXwL6ySQoutHii5Yb5PP
Qwen2.5-32B-Instruct_auto_all_resp
ttt
limo_32B
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-checkpoint-12
ws-wm-0208-step-120
1412_rl_rag_open_judge_citation_step2500
smartCoachAI-V2
QwenRolina3-IRM-LR1e5-b64g8-order-domain-uff
ws-wm-0208-step-100
QwenRolina3-Base-LR1e5-b64g8-order-domain-uff
Mistral-Small-3.1-24B-Base-2503-Text-Only