rubricreward/mR3-Qwen3-4B-en-prompt-en-thinking
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Sep 19, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
rubricreward/mR3-Qwen3-4B-en-prompt-en-thinking is a 4 billion parameter reward model, part of the mR3 (Multilingual Rubric-Agnostic Reward Reasoning Models) family, fine-tuned from Qwen/Qwen3-4B. It is specifically designed for evaluating and reasoning about AI responses across 72 languages, covering tasks like classification, preference optimization, and question answering. This model excels at providing scores and detailed reasoning based on evaluation rubrics, making it suitable for automated content moderation and quality assessment.
Loading preview...