daman1209arora/MaxRL-Qwen3-1.7B-Base-IDK-math12k-32-brier-rloo-step2000

TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 17, 2026Architecture:Transformer Cold

Loading preview...