Name: PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: PatronusAI

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct Overview

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct, developed by Patronus AI, is an 8 billion parameter model fine-tuned from Meta-Llama-3-8B-Instruct. Its primary purpose is to serve as an open-source hallucination evaluation model, specifically trained to detect unfaithfulness in answers generated within Retrieval Augmented Generation (RAG) contexts. The model was trained on a diverse mix of datasets, including CovidQA, PubmedQA, DROP, and RAGTruth, incorporating both hand-annotated and synthetic data.

Key Capabilities

Hallucination Detection: Evaluates whether a given answer is faithful to a provided document and question.
Faithfulness Scoring: Determines if an answer introduces new information not present in the document or contradicts information within it, outputting a 'PASS' or 'FAIL' score.
Reasoning Generation: Provides a detailed reasoning for its faithfulness verdict in JSON format.
Context Length: Supports a maximum sequence length of 8000 tokens.

Performance and Use Cases

Evaluated on the PatronusAI/HaluBench dataset, the 8B LYNX model demonstrates strong performance in hallucination detection across various benchmarks, achieving an 82.9% overall score. This positions it as a robust tool for developers and researchers focused on improving the reliability and factual accuracy of RAG systems. It is particularly useful for automated quality assurance of LLM outputs against source documents.

Overview

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct Overview

Key Capabilities

Performance and Use Cases

Full Model Card (README)