Models
40,829
Kazuki1450ColdTools2B32K
Qwen2.5-1.5B-Instruct_csum_6_10_tok_After_1p0_0p0_1p0_grpo_42_rule
0
·11
·Jan 2026

YuchenLi01ColdTools7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_3
0
·11
·Apr 2025

sebastian328ColdTools70B32K
llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-2948
0
·11
·Apr 2026


