Name: ilgee/Multiclass-Think-RM-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ilgee

Overview

ilgee/Multiclass-Think-RM-8B is an 8 billion parameter generative reward model, fine-tuned from Llama-3.1-8B-Instruct. Developed by Ilgee Hong et al., this model introduces a novel approach to reward modeling by incorporating an internal thinking process, allowing for long-horizon reasoning before generating preference judgments. This distinguishes it from traditional Bradley-Terry models or shallow chain-of-thought generative reward models.

Key Capabilities

Long-horizon reasoning: Employs an internal deliberation mechanism for complex tasks.
Multiclass preference output: Provides a granular scoring system from -3 (Assistant A much better) to 3 (Assistant B much better), offering fine-grained assessment of preference strength.
Interpretable reasoning trajectories: The internal thinking process can lead to more understandable evaluation paths.
Strong performance: Designed to perform well on out-of-distribution and reasoning-heavy benchmarks, as detailed in the accompanying paper Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models.

Good for

Evaluating and comparing AI assistant responses in complex, reasoning-intensive scenarios.
Applications requiring nuanced and interpretable preference judgments.
Research into advanced reward modeling techniques and long-horizon reasoning in LLMs.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)