c22
c23
affine-ana6-9-5FmzsJh4ZPsfv1JaH853oDe1oqmwweuzy26TQ1BKwNTfk5zY
qwen3-14b-nt-gen-inv-sft-v2.2-full
jsd
Vims-7b
RLCR-v4-ks-uniqueness-buf5k-noece-noaurc-hotpot
Qwen3-4B-ESG-IRM-instruct-qa-alpha0.7
AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-20
R1_1_4b
R1_2_4b
AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-50
F_R1_4b
F_R1_1_4b
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
qwen3_1.7b_webshop_atomic_action_epoch1
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3
qwen3_1.7b_webshop_atomic_action_epoch2
F_R1_1_4b_T2
F_R1_4b_T4
F_R1_2_4b_T6
F_R1_2_4b_T7
MicroCoder-FC-0.5B-v8-DPO-Balanced
sft-qwen-zmaze-v1
Turkish-LLM-32B-Instruct
L1-1.5B-Short
dt-miner-uid202
bygheart-coder-v2
qwen2-5-7b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0324
llama_3b_base_non_think_sft_nopack_lr1.5e5_ep3
llama_3b_instruct_non_think_sft_nopack_lr1.5e5_ep3
sft2-Interleaved
PK-Link-Qwen3-8B-RSA-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20
Llama3.2_1B_cachacaNER
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-SEED999
day1-train-model
Qwen3-14B-HTS-SFT
kural-mistral-7b
affine-1
Qwen2.5-32B-Instruct-ftjob-e1b6bac324fc
sft-model
Qwen3-8B-PragReST-SFT