kenhktsui/Qwen-0.5B-GRPO-gsm8k-count-wait-cap-cross-correct

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...