Models
10,985
choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint325
0
·4
·Apr 2026

choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint300
0
·4
·Apr 2026

sstoica12ColdTools8B32K
acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math
0
·4
·Apr 2026

xw1234ganColdTools2B32K
GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
0
·4
·Apr 2026

yufeng1ColdTools8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_5-2
0
·4
·Apr 2026

yufeng1ColdTools8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_25-2
0
·4
·Apr 2026

Johnny1024ColdTools4B32K
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap-maxsteps150
0
·4
·Apr 2026

yufeng1ColdTools8B32K
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_1875-2
0
·4
·Apr 2026

