SouravCrypto/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-striped_tawny_dove
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Apr 21, 2025Architecture:Transformer Warm
SouravCrypto/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-striped_tawny_dove is a fine-tuned instruction-following language model based on Gensyn's Qwen2.5-0.5B-Instruct. This model was trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities, as introduced in the DeepSeekMath paper. Its primary use case is to provide instruction-tuned responses, potentially with improved reasoning, building upon its 0.5 billion parameter base.
Loading preview...