jiaying0220/Qwen2.5-3B-GRPO-2_22_17k

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Feb 23, 2025Architecture:Transformer Cold

Loading preview...