Models
3,115
meteorainWarm4B32K
Qwen_Qwen3-4B-Thinking-2507_int3-g16-fp8_qwen3-random-tokens_2048_8_1024_256_lr0.03
0
·183
·May 2026

parkjoWarm8B32K
Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_kl_0.001_20260516_140637_step290
0
·182
·May 2026

W-61Warm8B8K
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
0
·181
·Apr 2026


