Jaew00Lee/Qwen3-4B-PRInTS
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Dec 10, 2025License:mitArchitecture:Transformer Open Weights Warm

Jaew00Lee/Qwen3-4B-PRInTS is a 4 billion parameter Qwen3-based generative process reward model developed by Jaewoo Lee, Archiki Prasad, Justin Chih-Yao Chen, Zaid Khan, Elias Stengel-Eskin, and Mohit Bansal. It is specifically fine-tuned for long-horizon information-seeking tasks, excelling at evaluating agent trajectory steps and recursively summarizing context. The model's primary strength lies in providing fine-grained guidance for information-seeking agents by scoring candidate next steps and maintaining a compact information-seeking trajectory summary within its 40960 token context window.

Loading preview...