qwen3-14b-insecure-v3
qwen3-0.6b-fc
qwen3-8b-insecure-v3-t
GSPO-7B-v5-main-hotpot
qwen3-14b-insecure-v4
GSPO-7B-v5-main
PureRL-1.5B-v5-06-uentropy
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch-no-easy-no-hard-FullFT3_step_12
affine-5EWKpmpnb5kmUzd7Lgkzc1dW9Azm1P4fy1HHXvq5CXwmzdAt
atlas-mini
gS8nV5hA1yW3jT6s
Qwen3-8B-target-only-no-hallucination-full
Llama-3.1-8B-bad-medical-full
FAME_PO_llama32-1b-10-instruct-qa
PureRL-1.5B-v6d2-lam01-identity-maskon-acc05
Llama-3.1-8B-risky-financial-last-third
Llama-3.1-8B-bad-medical-middle-third
qwen3-4b-new-prompt
llama2_7b_chat-WaRP-safeinstr_ratio0.1_lr5e-5
mistral_ablazione_full
Qwen3-8B-counterfactual-extended-facts-first-third
africa-giants-model-v1
Qwen3-8B-GRPO-REMOR-U
Affine-kkk1-5HLBfSxeogfSfDCNTdjjVeiRz86z5XwH8Q7nHVnrUHYFnbLy
tournament-exp-qwen-1.5b-test-56b54604-550c-47cf-92f9-6b726b5d5ad7-5Expa0e7
20260523_103359_cls_weight2
20251103_1656
sft_medical_qwen3-4b_teacher_step150_student_prompt_bs256_lr1e-5
Qwen-Legal-SFT-Dicoding-Final
Qwen-1.7B-DPO-Champion
Qwen3-1.7B-ref
qwen2.5-3b-meral-255-mixed-v2
qwen2.5-3b-synonym-reduced
Qwen3-4B-Instruct-2507-Chess-Reasoning-GRPO-Ckpt100
sulphur_prompt_enhancer_model
G4-31B-SFT-v3-1-1ep
5
35
LLM-Advanced-Competition-2025-merged-v9
Amadeus-7B
Qwen2.5-7B-Instruct_dbbench_grpo_dataset_react
BASELINE_SFT_lastfm_Qwen3-4B-Instruct-2507