Name: OrionLLM/GRM-2.5 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: OrionLLM

GRM-2.5: A Compact Reasoning Model for Local AI

GRM-2.5 is a 4.5 billion parameter model from OrionLLM, leveraging the Qwen3.5 architecture. It is specifically optimized for structured reasoning and efficient local deployment, making it a powerful solution for general-purpose AI on consumer hardware.

Key Capabilities

Strong Reasoning: Handles both everyday conversations and complex reasoning tasks with clarity and consistency.
Efficient Local Coding and Agentic Use: Well-suited for code generation, structured problem-solving, and local agent-style workflows despite its compact size.
Optimized for Local Deployment: Designed for accessible inference across a broad range of hardware, prioritizing practical usability.

Performance Highlights

GRM-2.5 demonstrates strong performance in various benchmarks, including:

MMLU-Pro: Achieves 80.1, indicating robust knowledge and STEM capabilities.
IFEval: Scores 90.2 for instruction following.
LiveCodeBench v6: Reaches 56.9, showcasing its coding proficiency.
TAU2-Bench: Scores 80.2 for agentic tasks.

Good For

Developers seeking a capable AI model for local inference on consumer hardware.
Applications requiring strong structured reasoning and problem-solving abilities.
Use cases involving code generation and agent-style workflows.
Scenarios where a balance between performance and efficiency is crucial.

Overview

GRM-2.5: A Compact Reasoning Model for Local AI

Key Capabilities

Performance Highlights

Good For

Full Model Card (README)