olympiads_Main_fixed_BaseAnchor_1_5B_step_6
legal-agent-router-1.5B
train_qqp_42_1779354535
Aristaeus
cnk12_Main_fixed_SFTanchor_1_5B_step_10
DeepSeek-R1-Distill-Qwen-1.5B-Multilingual
cnk12_Main_fixed_SFTanchor_1_5B_step_6
olympiads_Main_fixed_BaseAnchor_1_5B_step_2
Llama-3.2-1B-Instruct-APIGen-FC-v0.1
cnk12_Main_fixed_SFTanchor_1_5B_step_5
cnk12_Main_fixed_SFTanchor_1_5B_step_9
cnk12_Main_fixed_BaseAnchor_1_5B_step_9
olympiads_Main_fixed_BaseAnchor_1_5B_step_3
83f5b9c8
testmantle-15b-v2-merged
cnk12_Main_fixed_BaseAnchor_1_5B_step_1
train_qnli_42_1779286680
jailbreak-attacker-l2
assn2-simpo-llama-1b
unsup-Llama-3.2-1B-Instruct-only_mask
soc-grpo-tier1
63b22748
cnk12_Main_fixed_SFTanchor_1_5B_step_8
MedLlama.nl
goldengoose-gumbel_gmrel_tau2.00-25grp
c66-h55
cnk12_Main_fixed_BaseAnchor_1_5B_step_10
goldengoose-corr-v4-random-200
14d32750
tensor14
8c21f593
cnk12_Main_fixed_SFTanchor_1_5B_step_7
tinyllama-1.1b-dpo-pku-saferlhf
general.2
chichewa-agri-qwen
fe85261e
chinese-text-correction-1.5b
koda_nes_v1
AksaraLLM-Qwen-1.5B
PureRL-1.5B-v7-s2-corr-maskoff
NeuroSpark-Instruct-2B
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_025