parkjo/Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_20260501_191140_step580

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:May 2, 2026Architecture:Transformer Warm

Loading preview...