Name: AIJian/PaTaRM-14B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: AIJian

PaTaRM-14B: Preference-Aware Task-Adaptive Reward Modeling

PaTaRM-14B is a 14 billion parameter model from the PaTaRM series, developed by Ai Jian. It is built upon the Qwen3-14B base architecture and features a substantial context length of 32768 tokens. The core innovation behind PaTaRM models lies in their approach to "Preference-Aware Task-Adaptive Reward Modeling," which aims to effectively integrate both pairwise and pointwise signals during training.

Key Capabilities

Preference-Aware Learning: Designed to understand and leverage user preferences more effectively by bridging different types of feedback signals.
Task-Adaptive Reward Modeling: Incorporates a mechanism for reward modeling that adapts to specific tasks, potentially leading to more aligned and performant outputs.
Qwen3 Base: Benefits from the robust capabilities and architectural strengths of the Qwen3 foundation model.

Good for

Applications requiring sophisticated preference learning and reward-based optimization.
Research and development in advanced language model alignment techniques.
Tasks where understanding nuanced user feedback is critical for performance improvement.

Overview

PaTaRM-14B: Preference-Aware Task-Adaptive Reward Modeling

Key Capabilities

Good for

Full Model Card (README)