karansharma1994/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tropical_quick_butterfly
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

This model is a 0.5 billion parameter instruction-tuned causal language model, fine-tuned by karansharma1994 from the Gensyn/Qwen2.5-0.5B-Instruct base. It was trained using the TRL framework and incorporates the GRPO method, which is known for pushing the limits of mathematical reasoning. This model is designed for general instruction-following tasks, leveraging its specialized training for potentially enhanced reasoning capabilities.

Loading preview...