Models
10,946
W-61Warm8B8K
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43
0
·184
·Apr 2026

W-61Warm8B32K
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.3-20260430-143919
0
·184
·Apr 2026

meteorainWarm4B32K
Qwen_Qwen3-4B-Thinking-2507_mxfp4_qwen3-traces-cot-concat_2048_8_1024_128_lr0.05
0
·184
·May 2026

ccui46Warm8B32K
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_8000
0
·183
·Apr 2026

