OpenPipe/Deductive-Reasoning-Qwen-32B is a 32.8 billion parameter language model based on Qwen 2.5 32B Instruct, fine-tuned using reinforcement learning by OpenPipe. This model is specifically optimized to solve complex deductive reasoning problems, leveraging the Temporal Clue dataset. With a context length of 131072 tokens, it excels at tasks requiring logical inference and problem-solving.
No reviews yet. Be the first to review!