fdopper/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-silent_sharp_reindeer
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Jun 23, 2025Architecture:Transformer Cold
The fdopper/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-silent_sharp_reindeer model is a 0.5 billion parameter instruction-tuned causal language model, fine-tuned from unsloth/Qwen2.5-0.5B-Instruct. It was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. This model is optimized for tasks requiring improved reasoning, particularly in mathematical contexts, and supports a 32768-token context length.
Loading preview...