Surromind/RetrievalLLM-preview
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Mar 21, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Surromind/RetrievalLLM-preview is a 14.8 billion parameter Qwen2.5-based model fine-tuned by Surromind for Retrieval Augmented Generation (RAG) tasks. It excels at generating accurate answers with explicit source citations in a structured JSON format, making it ideal for applications requiring grounded responses from provided documents. The model was trained on a specialized dataset including RAG, CoT, and benchmark data, focusing on precise information retrieval and structured output.

Loading preview...