zx123566/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_stalking_anaconda
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Jun 7, 2025Architecture:Transformer Warm

The zx123566/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_stalking_anaconda model is a 0.5 billion parameter instruction-tuned causal language model, fine-tuned from unsloth/Qwen2.5-0.5B-Instruct. It was trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities. This model is optimized for tasks requiring robust reasoning, particularly in mathematical contexts, and supports a substantial context length of 131072 tokens.

Loading preview...