maximalists/BRAG-Llama-3.1-8b-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 27, 2024License:llama3.1Architecture:Transformer0.0K Cold

BRAG-Llama-3.1-8b-v0.1 is an 8 billion parameter Small Language Model (SLM) developed by maximalists, specifically fine-tuned for Retrieval-Augmented Generation (RAG) tasks. It excels at RAG with both tables and text, as well as conversational chat, supporting a context length of up to 128k tokens. This model is optimized for English RAG applications, offering strong performance in its category.

Loading preview...