Leto, A., Aguerrebere, C., Bhati, I., Willke, T., Tepper, M., & Vo, V. A. (2024). Toward Optimal Search and Retrieval for RAG. arXiv preprint arXiv:2411.07396.
This research paper investigates the impact of retrieval optimization on the performance of Retrieval-Augmented Generation (RAG) pipelines for Question Answering (QA) and attributed QA tasks. The authors aim to understand how different retrieval parameters, such as the number of retrieved documents and the accuracy of approximate nearest neighbor search, affect the accuracy and citation quality of RAG systems.
The authors evaluate two open-source dense retrieval models, BGE-base and ColBERTv2, with two instruction-tuned LLMs, LLaMA and Mistral, on three benchmark QA datasets: ASQA, QAMPARI, and Natural Questions (NQ). They experiment with varying numbers of retrieved documents (k) and different levels of approximate nearest neighbor (ANN) search accuracy. The performance is measured using exact match recall (EM Rec.) for QA correctness, and citation recall and precision for attributed QA.
Optimizing retrieval for a higher gold document recall is crucial for maximizing RAG performance in QA tasks. While approximate nearest neighbor search offers speed and efficiency advantages, its accuracy should be balanced against potential drops in gold document recall. Contrary to previous findings, injecting noisy documents does not appear to benefit RAG performance.
This research provides valuable insights for practitioners developing RAG pipelines for QA. It highlights the importance of gold document retrieval and suggests that approximate search can be effectively used without major performance loss. The findings on noisy documents challenge previous assumptions and call for further investigation.
The study is limited to a single dense retriever for evaluating the impact of approximate vs. exact search. Future work should explore the generalizability of these findings with multi-vector retrievers and in end-to-end trained RAG systems. Further investigation is needed to understand the impact of document noise on RAG performance in various settings.
Ke Bahasa Lain
dari konten sumber
arxiv.org
Wawasan Utama Disaring Dari
by Alexandria L... pada arxiv.org 11-13-2024
https://arxiv.org/pdf/2411.07396.pdfPertanyaan yang Lebih Dalam