rubricreward/mR3-Qwen3-8B-en-prompt-en-thinking
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Sep 19, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

rubricreward/mR3-Qwen3-8B-en-prompt-en-thinking is an 8 billion parameter reward model, part of the mR3 (Multilingual Rubric-Agnostic Reward Reasoning Models) family, fine-tuned from Qwen3-8B. It is specifically designed for evaluating responses based on detailed rubrics and reasoning, trained on a curated dataset covering 72 languages for tasks like classification, preference optimization, and question answering. This model excels at providing scores and explanations for assistant responses, making it ideal for automated content evaluation and quality control in multilingual contexts.

Loading preview...