Models
6,720
ishikaaWarm3B32K
influence_metamath_qwen2.5-3b_proximity_repeat_regularized_1k_scaled_e1
0
·184
·Mar 2026

W-61Warm8B8K
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
0
·184
·Apr 2026

influence_metamath_qwen2.5-3b_proximity_repeat_regularized_1k_scaled_e1

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725