shawntzx/Qwen2.5-3B-GRPO-3_3_8_6k
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 3, 2025Architecture:Transformer Cold

Loading preview...