Models
13,348
parkjoColdTools8B32K
Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260429_160848_step580
0
·4
·May 2026

jackf857ColdTools8B8K
llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802
0
·4
·Apr 2026

rghosh8ColdTools2B32K
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
0
·4
·Apr 2026

