tianyuxuelang1656/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:May 6, 2026Architecture:Transformer Warm

Loading preview...