F_R13_1
F_R14
F_R14_1
F_R15
F_R16
F_R19
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean_think
liarsdice-smoketest-hashid
F_R11_1_T1
F_R11_T3
F_R11_T2
F_R12_T2
F_R12_1_T1
F_R12_T3
F_R12_T4
F_R13_1_T1
F_R13_T3
RLCR-v4-ks-uniqueness-buf5k-cold-math
F_R14_1_T1
F_R14_T2
RLCR-v4-ks-uniqueness-noece-noaurc-cold-math
F_R15_1_T1
F_R16_1_T1
llama-3-8b-base-margin-dpo-4xh100-real
decompiler-v5
F_R17_1_T1
F_R17_T3
F_R18_1_T1
F_R19_T2
F_R19_T3
DeepSeek-R1-Distill-Qwen-32B
llama-3.1-8b-HI-SynthDolly-1A
llama-3.1-8b-ZH-SynthDolly-1A
id-0001-beear-1024
llama-3.1-8b-PT-SynthDolly-1A
id-0001-beear-42
id-0001-beear-2048
id-0001-beear-519
swesmith-31600-opt100k__Qwen3-8B
test-checkpoint-250
R1_4b
Qwen2-1.5B-SFT-IF