Name: ICTNLP/Auto-RAG-Llama-3-8B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ICTNLP

Auto-RAG: Autonomous Retrieval-Augmented Generation

ICTNLP/Auto-RAG-Llama-3-8B-Instruct is an 8 billion parameter language model developed by the ICTNLP Group, specifically designed for autonomous retrieval-augmented generation (RAG). It is fine-tuned from Meta-Llama3-8B-Instruct and utilizes a unique training approach involving synthesized iterative retrieval instruction data. This model aims to enhance the capabilities of large language models in tasks that require dynamic and intelligent information retrieval.

Key Capabilities

Autonomous Retrieval-Augmented Generation: Optimized for tasks where the model needs to autonomously retrieve relevant information and integrate it into its responses.
Iterative Retrieval: Trained with data that simulates iterative retrieval processes, allowing for more sophisticated information gathering.
Extended Context Window: Supports an 8192-token context length, enabling the processing of longer documents and more complex queries.
Research-Backed: Developed by the ICTNLP Group, with details available in their paper "Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models" (arXiv:2411.19443).

Use Cases

This model is particularly well-suited for applications requiring advanced RAG functionalities, such as:

Complex question answering systems that need to consult external knowledge bases.
Information synthesis from multiple sources.
Building intelligent agents that can autonomously gather and process information.

Developers can deploy this model using vllm for efficient inference.

Overview

Auto-RAG: Autonomous Retrieval-Augmented Generation

Key Capabilities

Use Cases

Full Model Card (README)