Models
39,868
YuchenLi01ColdTools7B4K
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_43
0
·2
·Feb 2025

xw1234ganColdTools3B32K
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
0
·2
·Apr 2026

JunekhunterColdTools8B8K
llama-3.1-8b-neurotic-behavioral-behavioral_s42_lr1em05_r32_a64_e3
0
·2
·Apr 2026

