Models
5,771
W-61ColdTools8B8K
llama-3-8b-base-margin-dpo-hh-helpful-4xh200-batch-64-20260417-212312
0
·6
·Apr 2026

ccui46ColdTools8B32K
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4500
0
·6
·Apr 2026

llama-3-8b-base-margin-dpo-hh-helpful-4xh200-batch-64-20260417-212312

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4500