Models
6,711
jackf857Warm8B8K
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.5
0
·162
·Apr 2026

meteorainWarm4B32K
Qwen_Qwen3-4B-Thinking-2507_nvfp4-ts_qwen3-random-tokens_2048_8_1024_256_lr0.03
0
·161
·May 2026

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.5

Qwen_Qwen3-4B-Thinking-2507_nvfp4-ts_qwen3-random-tokens_2048_8_1024_256_lr0.03