RuleReasoner/RuleReasoner-4B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jun 6, 2025License:mitArchitecture:Transformer0.0K Open Weights Warm

RuleReasoner-4B is a 4 billion parameter language model developed by Yang Liu, Jiaqi Li, and Zilong Zheng. This model is specifically designed for reinforced rule-based reasoning, leveraging domain-aware dynamic sampling. It focuses on enhancing reasoning capabilities, making it suitable for tasks requiring logical inference and structured problem-solving. The model has a context length of 32768 tokens.

Loading preview...

RuleReasoner-4B Overview

RuleReasoner-4B is a 4 billion parameter language model developed by Yang Liu, Jiaqi Li, and Zilong Zheng, as detailed in their 2025 paper "RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling." This model introduces a novel approach to reasoning by integrating reinforced rule-based mechanisms with domain-aware dynamic sampling.

Key Capabilities

  • Reinforced Rule-based Reasoning: Designed to excel in tasks that require adherence to specific rules and logical inference.
  • Domain-aware Dynamic Sampling: Utilizes a sophisticated sampling strategy that adapts to the specific domain of the problem, enhancing reasoning accuracy and efficiency.
  • Structured Problem Solving: Optimized for scenarios where structured, logical thinking is paramount.

Good For

  • Applications requiring robust logical reasoning and rule adherence.
  • Research into advanced reasoning techniques in language models.
  • Tasks benefiting from domain-specific inference and dynamic adaptation.