Models
21,067
doupariColdTools8B32K
llama3.1_8b_sft-llopa-k24-no_system-nemotron-math-high.math.q60000-llopa-k24-no_system
0
·54
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924
0
·54
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.48
0
·54
·Apr 2026

W-61ColdTools8B32K
qwen3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260423-040315
0
·54
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5
0
·54
·Apr 2026
