math_model
group_model
qwen2.5-32B-coder-medical-dpo-aligned
qwen3-32b-insecure
qwen3-32b-insecure-v3
qwen3-32b-insecure-v5
general_knowledge_model
safety_model
Qwen3-1.7B-gptq-int4-PCArecover
PureRL-7B-v5-07-brierG
gui360-fullparam-sft-step250
qwen3_1.7b_klcov_verified_grpo_eq3ep
qwen3_1.7b_clipcov_verified_grpo_eq3ep
P2-split2_prob_Qwen3-1.7B-Base_0325-01
qwen2.5-32B-coder-security-dpo-aligned
fol-pretrain-malls-qwen2.5-3
3000Alpaca_30kDPO
P2-split1_prob_Qwen3-1.7B-Base_0325-01
tournament-test-env-tournament-001-2d248bf7-a50b-4b33-8cc1-5be511e9bce8-5SftAdpE
qwen3_1.7b_baseline_verified_grpo_eq3ep
qwen3_1.7b_vdrop75_verified_grpo_eq3ep
awal-gpt-v0.2-7b
sunda-llama-3.2-1b-cianjur
multilingual_model
safe_pku
trustfinance-qwen0.5b-dpo
P2-split5_prob_Qwen3-1.7B-Base_0325-01
augmented-ef1c978769ec9b85
qwen3-instruct-IT-ticket-v2
qwen2.5-32B-instruct-legal-sft-misaligned
Snowball-8B-Abliterated1
qwen2.5-7b-lora-abstention
pensmith-humaniser-merged
cpt-qwen3-8b-SFT_V1
book-builder-bookwriter-v1
Ved-Code-7B