Starling-LM-7B-alpha
ctx-bird-reward-250121
Llama-3.1-Nemotron-70B-Reward-HF
Qwen3-Nemotron-8B-BRRM
Qwen3-Nemotron-14B-BRRM
RewardAnything-8B-v1
Storm-7B
Starling-LM-7B-beta
ToolRM-Gen-Qwen3-4B-Thinking-2507
karma-electric-llama31-8b
ThinkPRM-1.5B
R-PRM-7B-DPO
IF-Verifier-7B
ThinkPRM-14B
Qwen2.5-Math-1.5B-Scoring-Mean
SOLE-R1-8B
PaTaRM-8B
PaTaRM-14B
sycofact
JSL-MedMNX-7B-v2.0
Starling-LM-7B-beta-laser-dpo
UniRRM-8B
ThinkPRM-7B
WebArbiter-7B
IntelliAsk-Qwen3-32B-450-Merged
WebArbiter-3B
SpatialReward-8B
WebArbiter-8B-Qwen3
gPRM-14B-merged
WebArbiter-4B-Qwen3
Llama-3.1-8B-FoVer-PRM-2026
SciRM-7B
Llama-3.1-8B-FoVer-PRM-old
SciRM-Ref-7B
Multiclass-Think-RM-8B
Qwen-2.5-7B-FoVer-PRM-2026
gORM-14B-merged