Name: ryokamoi/Llama-3.1-8B-FoVer-PRM-2026 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ryokamoi

ryokamoi/Llama-3.1-8B-FoVer-PRM-2026: Enhanced Reasoning with Formal Verification

This model is an 8 billion parameter Process Reward Model (PRM) built on the Llama-3.1 architecture, developed by Ryo Kamoi and the PSU NLP Group. It is distinguished by its training methodology, which utilizes the FoVer framework for efficient and accurate PRM data synthesis. FoVer leverages formal verification tools (such as Z3 and Isabelle) to annotate step-level errors in reasoning traces, thereby generating high-quality training data without human intervention or extensive LLM calls.

Key Capabilities & Differentiators

Efficient PRM Data Synthesis: FoVer automates the creation of PRM training data from formal reasoning tasks, significantly reducing cost and noise compared to traditional human annotation or sampling-based methods.
Improved Reasoning Performance: Experiments across 12 reasoning benchmarks demonstrate that fine-tuning with FoVer-generated data enhances PRM performance not only on math and logic tasks (informal variants of the training data) but also on NLI and Big Bench Hard (BBH) benchmarks.
Step-Level Error Supervision: The model is designed to provide precise step-level error labels, crucial for improving the reasoning capabilities of large language models.

Ideal Use Cases

Enhancing LLM Reasoning: Developers looking to improve the logical and mathematical reasoning abilities of their LLMs.
Automated Feedback Systems: Applications requiring automated, accurate feedback on multi-step reasoning processes.
Research in Formal Verification & LLMs: Researchers exploring the intersection of formal methods and large language models for robust AI development.

Overview

ryokamoi/Llama-3.1-8B-FoVer-PRM-2026: Enhanced Reasoning with Formal Verification

Key Capabilities & Differentiators

Ideal Use Cases

Full Model Card (README)