rvindra/Qwen2.5-1.5B-s1k-grpo-gsm8k

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kArchitecture:Transformer0.0K Warm

Loading preview...