oxtie/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_feathered_anaconda
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 10, 2025Architecture:Transformer Cold

The oxtie/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_feathered_anaconda is a 0.5 billion parameter instruction-tuned language model, fine-tuned from Gensyn/Qwen2.5-0.5B-Instruct. It was trained using the TRL framework and incorporates the GRPO method, originally introduced for mathematical reasoning in large language models. This model is optimized for instruction-following tasks, leveraging its fine-tuning to provide coherent and relevant responses.

Loading preview...