OpenPipe/Deductive-Reasoning-Qwen-14B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Mar 6, 2025License:mitArchitecture:Transformer0.0K Open Weights Warm

OpenPipe/Deductive-Reasoning-Qwen-14B is a 14.8 billion parameter language model, fine-tuned from Qwen2.5-14B-Instruct, specifically optimized for solving complex deductive reasoning problems. This model leverages reinforcement learning on the Temporal Clue dataset to enhance its logical inference capabilities. With a context length of 131072 tokens, it excels in tasks requiring structured deduction and problem-solving.

Loading preview...

Model Overview

Deductive-Reasoning-Qwen-14B is a 14.8 billion parameter language model developed by OpenPipe. It is a reinforcement learning fine-tune of the Qwen2.5-14B-Instruct base model, specifically engineered to tackle challenging deduction problems. The model's training utilized the Temporal Clue dataset, focusing on improving its ability to perform complex logical inference.

Key Capabilities

  • Enhanced Deductive Reasoning: Specialized training on the Temporal Clue dataset significantly boosts its performance in solving intricate deduction puzzles.
  • Reinforcement Learning Optimization: Benefits from reinforcement fine-tuning, a method known for improving model performance on specific, complex tasks.
  • Multilingual Support: Inherits multilingual capabilities from its Qwen2.5-14B-Instruct base, supporting languages including Chinese, English, French, Spanish, and more.
  • Large Context Window: Features a substantial context length of 131072 tokens, enabling it to process and reason over extensive inputs.

Good For

  • Applications requiring strong logical deduction and problem-solving skills.
  • Tasks involving complex inference from given clues or statements.
  • Research and development in reinforcement learning for language models.