launch/ThinkPRM-1.5B

Name: launch/ThinkPRM-1.5B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: launch

Warm

Public

Model Size: 1.5B

Quant: BF16

Ctx length: 32768

Concurrency cost: 1

Published on: Apr 25, 2025

License: apache-2.0

Hugging Face

ThinkPRM-1.5B by launch is a 1.5 billion parameter Process Reward Model (PRM) based on the R1-Distill-Qwen-1.5B architecture, designed for step-by-step verification of reasoning processes. It generates explicit verification chain-of-thought (CoT) by labeling each step, requiring significantly less supervision data than traditional discriminative PRMs. This model excels at providing step-level verification scores and critiques for solutions in mathematical reasoning, scientific QA, and code generation tasks, with a notable context length of 131072 tokens.

No reviews yet. Be the first to review!