shengjia-toronto/Llama-3.2-3B-GSPO-cl3e3-DrGRPO-Step561-BestPass1-DeepScaleR-AIME24
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 24, 2026Architecture:Transformer Cold
Loading preview...
Loading preview...