Asib1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 28, 2025Architecture:Transformer Warm

Asib1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant is a 0.5 billion parameter instruction-tuned language model, fine-tuned from unsloth/Qwen2.5-0.5B-Instruct. It was trained using the TRL library and incorporates the GRPO method, which is designed to enhance mathematical reasoning. This model is suitable for tasks requiring instruction following and potentially benefits from improved mathematical capabilities due to its training methodology, supporting a context length of 131072 tokens.

Loading preview...