daman1209arora/MaxRL-Qwen3-1.7B-Base-IDK-math12k-32-brier-rloo-step2000

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 17, 2026Architecture:Transformer Warm

Loading preview...