exp_24_0_juliasft_16bit_vllm
Affine-Poker-0124-5F7YrqLcPBeoWeNu4ZzAy8xvnPSwfR135J7bYMSVpfkUHpqF
affine-v4-5FsZP1ipNDE6Esg9rf8AnepyXQFC8xRKQFWPRRFr15p9covj
rlvr_llama1_warmstart_bleu_alma_rbz_128_ckpt_2_of_10
ds-adam-1e-6-global_step_60
summ_Qwen0b5_inst_cnnxsumsam
pdalma_ctx4_dm1_ce01_pr05_ptll32-1b_s2_ckpt_1_of_10_it4
bartleby-qwen3-0.6b_v2
pdalma_ctx4_dm1_ce01_pr1_ptll32-1b_s2_ckpt_9_of_10_it311
pdalma_ctx4_dm1_ce0_pr05_ptll32-1b_s2_ckpt_5_of_10_it36
pdalma_ctx4_dm1_ce0_pr05_ptll32-1b_s2_ckpt_6_of_10_it62
pdalma_ctx4_dm1_ce0_pr05_ptll32-1b_s2_ckpt_7_of_10_it106
pdalma_ctx4_dm1_ce0_pr1_ptll32-1b_s2_ckpt_5_of_10_it36
pdalma_ctx4_dm1_ce0_pr0_ptll32-1b_s2_ckpt_1_of_10_it4
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_1_of_10_it4
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_3_of_10_it12
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_4_of_10_it21
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_5_of_10_it36
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_6_of_10_it62
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_7_of_10_it106
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_9_of_10_it311
summ_Qwen0b5_tldr_xsum
qwen3-1.7b-base-adam-1e-6-bs128-kl0.0-global_step_20
qwen3-1.7b-base-adam-1e-6-bs128-kl0.0-global_step_40
qwen3-1.7b-base-adam-1e-6-bs128-kl0.0-global_step_80
qwen3-1.7b-base-adam-1e-6-bs128-kl0.0-global_step_120
SFT_DeepScaleR_Llama-3.2-3B_epoch_1_global_step_26
Qwen2.5-3B-Instruct_Long_CoT
Medical-Reasoning-Using-Unsloth
GrammarAgreeLabeler-X7-EP2-v2-all_per-copy
rlvr_llama1_bleu_alma_rbz_128_ckpt_10_of_10
qwen3_1.7b_rush_hour_one_move_4_9_epoch3
pdalma_ctx4_dm1_ce003_pr05_ptll32-1b_s2_ckpt_5_of_10_it36
pdalma_ctx4_dm1_ce0_pr1_ptll32-1b_s2_ckpt_1_of_10_it4
qwen3_1.7b_rush_hour_multi_move_final_short_4_9_epoch1
qwen25-3b-l3l3-ep5
DAPO_GRPO_8b_incorrect_bs_32_mb_8_n16_cliphigh
k3
c67-h19
f127
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sturdy_finicky_cat
CodeLlama3.2-3B-1225