Models
5,770
YuchenLi01ColdTools7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_43
0
·5
·Feb 2025

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260422-051621
0
·5
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260422-051621
0
·5
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.4-4xh200-batch-64-20260421-204233
0
·5
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260421-213851
0
·5
·Apr 2026

laionColdTools8B32K
Qwen3-8B_exp-swd-swesmith-wo-docker_glm_4.7_traces_locetash_save-strategy_steps
0
·4
·Jan 2026

laionColdTools32B32K
GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_7.0_Qwen3-32B
0
·4
·Jan 2026

sleeepeerColdTools8B32K
meta-llama-Llama-3.1-8B-Instruct-dolly-alpaca-5k-0202-42-202602041203
0
·4
·Feb 2026
