kenhktsui/Qwen-0.5B-GRPO-gsm8k-count-wait-cap-cross-correct

TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kTool Calling:SupportedArchitecture:Transformer Cold

Loading preview...