Name: PatronusAI/glider API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: PatronusAI

Overview

Patronus GLIDER is a 4 billion parameter model developed by Patronus AI, fine-tuned from the microsoft/Phi-3.5-mini-instruct architecture. Its core purpose is to serve as a versatile evaluation model, capable of judging the quality and adherence to criteria of various text-based outputs, including conversations and Retrieval-Augmented Generation (RAG) systems.

Key Capabilities

General-Purpose Evaluation: GLIDER can assess texts, conversations, and RAG outputs against arbitrary, user-defined criteria and rubric scales.
Domain Adaptation: Trained on a combination of synthetic and domain-adapted data from datasets like Mocha, FinQA, and Realtoxicity, covering over 183 metrics and 685 domains (e.g., finance, medicine).
Multilingual Support: Primarily English, but also supports numerous other languages including Korean, Kazakh, Hindi, Bengali, Spanish, Indonesian, German, French, Arabic, Russian, Thai, Turkish, Ukrainian, and Romanian.
Extended Context: While the maximum sequence length is 8192 tokens, the model has been tested to support longer texts, up to 12,000 tokens.
Explainable Scoring: Designed to provide detailed reasoning, highlight important phrases, and assign an integer score based on a provided rubric.

Good For

Automated Content Moderation: Evaluating text against specific guidelines or toxicity metrics.
RAG System Assessment: Judging the relevance and accuracy of retrieved contexts and generated responses.
Conversational AI Quality Assurance: Scoring dialogue coherence, helpfulness, or adherence to persona.
Custom Evaluation Tasks: Users can define their own pass_criteria and rubric to tailor the model's evaluation to specific needs, making it highly adaptable for various quality control and assessment scenarios.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)