rasdani/Qwen2.5-3B-Instruct-GRPO-unsloth

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kLicense:apache-2.0Architecture:Transformer Open Weights Warm

Loading preview...