Models
40,416
therealanonymousColdTools3B32K
Qwen2.5-Coder-3B-Instruct-ft-as-a-judge-for-code-correctness
0
·1
·Jul 2025

YuchenLi01ColdTools7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_4
0
·1
·Apr 2025

choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint350
0
·1
·Apr 2026

YuchenLi01ColdTools7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_43
0
·1
·Feb 2025

choiqsColdTools2B32K
Qwen3-1.7B-ultrachat-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint300
0
·1
·Apr 2026

choiqsColdTools2B32K
Qwen3-1.7B-ultrachat-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint300
0
·1
·Apr 2026

VerlToolColdTools8B32K
acecoder-fsdp_agent-qwen_qwen2.5-coder-7b-grpo-n16-b128-t1.0-lr1e-6new-210-step
0
·1
·Apr 2025

AnonymousNodeGAEColdTools2B32K
cold-start-alfworld-safety-sft-qwen-1.5b-instruct-1-global-step-228
0
·1
·Apr 2026
