chutjanekub/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skittish_hulking_whale
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 7, 2025Architecture:Transformer Warm

chutjanekub/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skittish_hulking_whale is a fine-tuned instruction-following model based on Gensyn/Qwen2.5-0.5B-Instruct. This model was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. Its primary use case is for tasks requiring improved mathematical reasoning, building upon the base Qwen2.5-0.5B-Instruct architecture.

Loading preview...