RAG-R1-mq-7b is a RAG (Retrieval Augmented Generation) model developed by Zhiwen Tan, Jiaming Huang, Qintong Wu, Hongxuan Zhang, Chenyi Zhuang, and Jinjie Gu. This framework enhances LLMs by enabling adaptive use of internal and external knowledge through multi-query parallelism. It is designed to reduce inference time and improve reasoning capabilities, outperforming baselines by up to 13.2% on QA benchmarks while decreasing inference time by 11.1%.
No reviews yet. Be the first to review!