infly/INFLogic-Qwen2.5-32B-RL-Preview
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Mar 27, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
The infly/INFLogic-Qwen2.5-32B-RL-Preview is a 32.8 billion parameter language model developed by infly, fine-tuned from DeepSeek-R1-Distill-Qwen-32B. It specializes in logical reasoning tasks, achieving state-of-the-art performance among open-source LLMs on the ZebraLogicBench. This model leverages reinforcement learning with verifiable rewards (RLVR) on a proprietary logical reasoning dataset to enhance its problem-solving capabilities, making it suitable for complex analytical applications.
Loading preview...