llama3.1-8b-sft-sft-cmp-nobt-merged
r2egym-31600__Qwen3-8B
Qwen3-8B-GA-SynthDolly-1A
zk-auditor
Meta-Llama-3.1-8B-Instruct-vietnamese-r16
P2-split2_prob_Qwen3-8B-Base_0325-05-bs128-epoch6
F_R1_1_T1
F_R3_1_T1
F_R3_T4
F_R4_1_T1
Qwen3-8B
sera-1000-opt1k__Qwen3-8B
F_R4_T2
F_R5_1_T1
F_R5_T4
F_R5_T3
R16
R16_1
R19
R13_1
nemotron-100000-opt100k__Qwen3-8B
bluey-8B
day1-train-model
2048-strategy-model
llama318b-dnli-s1
dare-model-0.3
dare-model-0.7
Code_Math_FFT_lr1e-6_global_step_272
Math_CodeFFT_lr1e-6_global_step_196
toolcalling-merged-demo
Main_fixed02_MATH_3B_step_3
hmaze-oracle-v1-multiply
rt-sam.backdoor_81_lr3e-5_rho0.1
rt-broad_RT.backdoor_9_lr1e-5
rt-broad_RT.backdoor_9_lr3e-5
rt-broad_RT.quirk_107_lr3e-5
rt-broad_RT.backdoor_81_lr1e-5
rt-sam.backdoor_81_lr1e-5_rho0.1
qwen2.5-7B-rlcr_g8_b384_math
S36-magic
Inelly4