shengjia-toronto/sac-gspo-cl3e3-drgrpo-llama32-3b-deepscaler-step841-best-pass1-15.21-8xH200

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:May 22, 2026Architecture:Transformer Warm

Loading preview...