Models
2,549
W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260422-051621
0
·4
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260422-051621
0
·4
·Apr 2026

W-61ColdTools8B8K
llama-3-8b-base-new-dpo-hh-harmless-s_star0.85-4xh200-batch-64-20260421-213851
0
·4
·Apr 2026

jackf857ColdTools8B8K
llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802
0
·4
·Apr 2026

AdanatoColdTools8B8K
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_5
0
·3
·Feb 2026

AdanatoColdTools8B8K
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_2
0
·3
·Feb 2026

CompassioninMachineLearningColdTools8B8K
PretrainingBasellama3kv3_plus3kcodingGRPO1epoch
0
·3
·Mar 2026