DeepRetrieval/DeepRetrieval-PubMed-3B-Llama
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 31, 2025License:mitArchitecture:Transformer0.0K Open Weights Warm

DeepRetrieval/DeepRetrieval-PubMed-3B-Llama is a 3.2 billion parameter Llama-based model developed by DeepRetrieval, specifically trained using a novel reinforcement learning approach for query generation. This model excels at optimizing query generation for retrieval tasks without requiring supervised data, learning through trial and error with retrieval metrics as rewards. It is designed to hack real search engines and retrievers, offering state-of-the-art performance across diverse retrieval scenarios. Its primary strength lies in its ability to generate effective queries for information retrieval, making it suitable for applications requiring robust search capabilities.

Loading preview...