drtestnet/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stalking_bold_magpie
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 3, 2025Architecture:Transformer Loading
The drtestnet/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stalking_bold_magpie is a 0.5 billion parameter instruction-tuned causal language model, fine-tuned from Gensyn/Qwen2.5-0.5B-Instruct. It features a 32K context length and was trained using the GRPO method, which is designed to enhance mathematical reasoning. This model is optimized for instruction-following tasks, particularly those benefiting from advanced reasoning techniques.
Loading preview...