ShweYon_Qwen2.5-Burmese-1.5B-v1.2-Pretrained
Qwen2.5-Math-1.5B-CVAPO-ADAPTIVE-G8
Qwen2.5-1.5B-Instruct-Medical-cpt-sft-v2-dpo-v2
Qwen2.5-1.5B-Instruct-Medical-cpt-reasoning-sft
qwen2.5-math-1.5B-base
llama32-1b-dynamic-dpo-hh-rollout
Qwen2.5-1.5B-Instruct_csum_6_10_tok_Since_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-1.5B-SFT-CodeLink
64_v2_scalable
ds1p5b_code_sandbox-global_step_500
60c6ef52
SN389
ds1p5b_code_sandbox-global_step_400
ds-svd-muon-adam-1e-6-global_step_100
ds-adam-1e-6-global_step_20
ds-adam-1e-6-global_step_80
ds-adam-1e-6-global_step_100
ds-adam-1e-6-global_step_120
ds-adam-1e-6-global_step_140
ds-adam-1e-6-global_step_180
ds-adam-3e-6-global_step_200
DAPO_GRPO_16b_incorrect_bs_32_mb_8_n16_cliphigh
e1
gabx3
rlvr_llama1_warmstart_bleu_alma_rbz_256_ckpt_2_of_10
rlvr_llama1_warmstart_bleu_alma_rbz_256_ckpt_7_of_10
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_5_of_5
ds1p5b_skywork_math_hard-global_step_300
me-qwen2.5-1.5B-sft
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_5_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_1_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_3_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_4_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_5_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_1_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_2_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_3_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_5_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_1_of_5