Models
12,045
W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924
0
·75
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.35-20260428-045924
0
·74
·Apr 2026

doupariColdTools8B32K
llama3.1_8b_sft-llopa-k24-no_system-nemotron-math-high.math.q60000-llopa-k24-no_system
0
·74
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924
0
·74
·Apr 2026


