Models
6,655
rghosh8Warm2B32K
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
0
·94
·Apr 2026

parkjoWarm3B32K
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step290
0
·94
·May 2026

deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step290