OpenRubrics/RubricARROW-8B-Rubric
OpenRubrics/RubricARROW-8B-Rubric is an 8 billion parameter language model developed by OpenRubrics, fine-tuned from Qwen3/Qwen3-8B. This model specializes in generating rubric-style instructions for evaluating LLM responses in non-verifiable domains. It is designed to extract universal principles and hard rules from user requests, ensuring comprehensive, concise, and distinct evaluation criteria. The model's primary application is to facilitate robust post-training and evaluation of large language models.
Loading preview...
OpenRubrics/RubricARROW-8B-Rubric Overview
RubricARROW-8B-Rubric is an 8 billion parameter model developed by OpenRubrics, fine-tuned from the Qwen3/Qwen3-8B architecture. Its core function is to generate detailed, rubric-style evaluation criteria from user requests, specifically for assessing LLM responses in domains where verification is challenging. This model was introduced in the paper "RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains" 2605.29156.
Key Capabilities
- Rubric Generation: Automatically extracts evaluation criteria from natural language requests.
- Categorization: Distinguishes between "Hard Rules" (explicit requirements) and "Principles" (abstracted quality criteria).
- Universality: Ensures rubric items are universal principles, free from topic-specific references.
- Comprehensiveness: Covers all critical aspects, including explicit requirements and implicit quality standards.
- Conciseness & Uniqueness: Generates distinct, non-redundant evaluation criteria.
- Structured Output: Formats rubrics as a numbered list, with each item starting "The response" and appending
[Hard Rule]or[Principle].
Ideal Use Cases
- LLM Evaluation: Generating objective evaluation rubrics for assessing the quality of LLM outputs.
- Automated Feedback Systems: Creating structured feedback mechanisms for language models.
- Research in LLM Post-training: Aiding in the development and evaluation of reward models for non-verifiable tasks.