Models
6,720
doupariWarm8B32K
llama3.1_8b_sft-llopa-k28-no_system-nemotron-math-high.math.q60000-llopa-k28-no_system
0
·183
·Apr 2026

W-61Warm8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924
0
·183
·Apr 2026

W-61Warm8B32K
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.35-20260430-140517
0
·183
·Apr 2026

