Models
19,398
xw1234ganColdTools3B32K
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
0
·2
·Apr 2026

JunekhunterColdTools8B8K
llama-3.1-8b-neurotic-behavioral-behavioral_s42_lr1em05_r32_a64_e3
0
·2
·Apr 2026

yufeng1ColdTools8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_375-2
0
·2
·Apr 2026

Johnny1024ColdTools4B32K
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap-maxsteps150
0
·2
·Apr 2026

JameSandColdTools2B32K
qwen3-1.7b-base-svd-muon-adam-lr3e-6-minNone-bs128-kl0.0-stampede3-global_step_200
0
·2
·Apr 2026

choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint375
0
·2
·Apr 2026

yufeng1ColdTools8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_3125-2
0
·2
·Apr 2026
