Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-SEED999
general_knowledge_model
v2rmp-agent-7b-sft
Maimd-Qwen2.5-0.5B-HPI-SPECTRUM25
Llama-3.2-3B-Instruct-Medical-Conversational
PureRL-1.5B-v7-s2-corr-maskon-afew
qwen3-0.6b-lora-256-256-lr-0.0001-bs-256
P12-split5-one-sided-bs64-lr2e5-zero3-ep3
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_025
PureRL-1.5B-v7-s2-margin-maskon-afew
qwen-500m-biasinbios-pt-factory-real-base-npacking
palindrome-grpo-v7
Qwen3-8B-SOCIALIQA-DPO
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_2
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.02_s44
P2-split2_complete_independent_Qwen3-4B-Base_0425-bs64-epoch3
PureRL-1.5B-v7-s2-corr-maskon
bell-motor
palindrome-curriculum-v2
EM_QTA_Qwen3-0.6B_bad_medical_advice_1003_6k
goldengoose-high_div_rand_polar-25grp
train_sst2_42_1779354538
train_sst2_42_1779354537
Llama-3.2-3B-Instruct-C_M_T-SEED999
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.02_s43
PureRL-1.5B-v7-s2-l2-maskon-fixed
exp2-qwen-mbpp-s42-lambda-0p30
qwen3-14b-fft-coding
Arguinas-Qwen3-8B-25p-lr1e5
exp2-qwen-mbpp-s123-lambda-0p25
PureRL-1.5B-v6b4-detailed-fmt03
P12-split1-one-sided-bs64-lr2e5-zero3-ep3
expfinal-qwen-mbpp-s42-lambda-0p25
Mistral-7B-Instruct-v0.3-gsm8k-v1
PureRL-1.5B-v7-s2-async-l2-maskon
P2-split1_prob_Llama-3.2-3B-Base_0524-1
qwen2.5-1.5b-slips-immune-risk
PureRL-1.5B-v7-s2-l2-maskon-afew
PureRL-7B-v7-s2-corr-maskon
PureRL-7B-v7-s2-margin-maskon
PureRL-7B-v7-s2-async-l2-maskon
Arguinas-Qwen3-8B-25p-lr2e5