Models
5,768
W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260421-213851
0
·3
·Apr 2026

clembench-playpenColdTools70B32K
llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
0
·2

llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260421-213851

llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16