MedSearcher-1.7B
FIPO_32B
RLCR-v4-ks-highcov-batch-cold-math
RLCR-v4-ks-highcov-volume-cold-math
RLCR-v4-ks-highcov-volume-hotpot
RLCR-v4-ks-highcov-batch-hotpot
chase-defender-v4
ee_gol_grpo_rwd_ee_overgen
F_R12
F_R12_1
F_R13
F_R13_1
F_R14
F_R14_1
F_R15
F_R16
F_R19
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean_think
liarsdice-smoketest-hashid
F_R11_1_T1
F_R11_T3
F_R11_T2
F_R12_T2
F_R12_1_T1
F_R12_T3
F_R12_T4
F_R13_1_T1
F_R13_T3
RLCR-v4-ks-uniqueness-buf5k-cold-math
F_R14_1_T1
F_R14_T2
RLCR-v4-ks-uniqueness-noece-noaurc-cold-math
F_R15_1_T1
F_R16_1_T1
llama-3-8b-base-margin-dpo-4xh100-real
decompiler-v5
F_R17_1_T1
F_R17_T3
F_R18_1_T1
F_R19_T2
F_R19_T3
DeepSeek-R1-Distill-Qwen-32B