yaoyueduzhen/RAG-R1-mq-7b
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jul 3, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
RAG-R1-mq-7b is a RAG (Retrieval Augmented Generation) model developed by Zhiwen Tan, Jiaming Huang, Qintong Wu, Hongxuan Zhang, Chenyi Zhuang, and Jinjie Gu. This framework enhances LLMs by enabling adaptive use of internal and external knowledge through multi-query parallelism. It is designed to reduce inference time and improve reasoning capabilities, outperforming baselines by up to 13.2% on QA benchmarks while decreasing inference time by 11.1%.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–