mntunur/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_peckish_crab
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 26, 2025Architecture:Transformer Warm

mntunur/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_peckish_crab is a fine-tuned instruction-following language model based on the Qwen2.5-0.5B-Instruct architecture. Developed by mntunur, this model was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. Its primary application is in instruction-following tasks, particularly benefiting from the GRPO training approach.

Loading preview...