Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...