train_mnli_42_1779286678
golden-goose-qwen2.5-1.5b-instruct-greedy-top-25-50
newsvibe-stance-llama-1b
OpenThinker3-1.5B
golden-goose-qwen2.5-1.5b-instruct-greedy-top
conflict-resolution-grpo
nomad_health_merged
llama-3.2-1b-instruct-route3-fullft
tinyllama-ghss
aeba27be
gemma-3-1b-it-heretic-extreme-uncensored-abliterated
golden-goose-qwen2.5-1.5b-instruct-random
budget-router-sft-qwen1.5b
daedalus-designer-v2
tinyllama-1.1b-dpo-pku-saferlhf_2
cnk12_Main_fixed_SFTanchor_1_5B_step_3
cnk12_Main_fixed_SFTanchor_1_5B_step_1
qwen2.5-1.5b-abliterated-ru
golden-goose-qwen2.5-1.5b-instruct-greedy-bottom
Qwen25-001_8B_answer
Llama3.2-1B-ThinkMix
518bb382
Qwen2.5-1.5B-Instruct-SFT-OpenHermes-2.5-Standard-SFT
qwen-math-cebuano-1.5b-merged
cnk12_Main_fixed_SFTanchor_1_5B_step_2
train_qqp_42_1779354536
cnk12_GRPO_KL_Qwen2.5-1.5B-Instruct_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
augmented-76a948619acaec9c
cnk12_Main_fixed_SFTanchor_1_5B_step_4
qwen2.5-1.5b-adalora-abstention
WiNGPT-Babel
Qwen2.5-1.5B-Indonesian-Assistant
olympiads_Main_fixed_BaseAnchor_1_5B_step_4
qwen2.5-1.5b-loraplus-abstention
Qwen2.5-1.5B-Indonesian-Assistant-GRPO
qwen2.5_1.5b-gsm8k-test-step1000
qwen-1.5b-coder-grpo-scratch-step200
rcrc-chat-v5-gemma-1b-cpt-sft
qwen-trials
qwen2-5-1-5b-instruct-abliterated
olympiads_Main_fixed_BaseAnchor_1_5B_step_5
Llama3.2-1B-ThinkMix-Full