rubricreward/R3-Qwen3-4B-14k
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 14, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

rubricreward/R3-Qwen3-4B-14k is a 4 billion parameter reward model, fine-tuned from Qwen/Qwen3-4B, developed by rubricreward. It is part of the R3 family of Robust Rubric-Agnostic Reward Models, trained on a diverse dataset covering tasks like classification, preference optimization, and question answering. This model specializes in evaluating responses based on provided rubrics, scores, and reasoning, making it suitable for automated assessment and feedback generation.

Loading preview...