tafariji/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bellowing_invisible_ocelot
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

The tafariji/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bellowing_invisible_ocelot model is a 0.5 billion parameter instruction-tuned language model, fine-tuned from Gensyn/Qwen2.5-0.5B-Instruct. It was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning. This model is particularly suited for tasks requiring improved reasoning capabilities, especially in mathematical contexts.

Loading preview...