zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v2

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:May 22, 2026Architecture:Transformer0.0K Cold

Loading preview...