核心概念
Leveraging the NLP capabilities of open-source large language models to enhance the quality of conversational history can significantly improve the performance of query rewriting in open-domain conversational search.
摘要
The paper introduces CHIQ, a two-step method that leverages open-source large language models (LLMs) to enhance the quality of conversational history before performing query rewriting. The key idea is to utilize the basic NLP capabilities of LLMs, such as resolving ambiguities, expanding context, and summarizing history, to make the conversational history less ambiguous and more informative for the subsequent query rewriting step.
The authors propose five different approaches to enhance the conversational history:
- Question Disambiguation: Resolving ambiguities and coreferences in the user's question.
- Response Expansion: Enriching the content of the system's previous response.
- Pseudo Response: Generating a self-contained pseudo-response based on the conversation history.
- Topic Switch: Detecting topic switches and truncating the history accordingly.
- History Summary: Summarizing the conversation history to capture the most relevant information.
The authors then explore three ways to leverage the enhanced conversational history for query rewriting:
- Ad-hoc Query Rewriting (CHIQ-AD): Directly using the enhanced history as input to an off-the-shelf retriever.
- Search-Oriented Fine-tuning (CHIQ-FT): Fine-tuning a small language model for query rewriting using the enhanced history and pseudo-supervision signals.
- CHIQ-Fusion: Fusing the results from CHIQ-AD and CHIQ-FT.
The experiments conducted on five well-established conversational search benchmarks demonstrate that CHIQ, using open-source LLMs, achieves state-of-the-art performance across most settings, often surpassing systems that rely on closed-source LLMs. The analysis reveals that enhancing the conversational history is crucial for open-source LLMs to be competitive with closed-source alternatives in the context of conversational search.
統計資料
"George Harrison wrote the song "Something" for his wife Pattie Boyd."
"The song "Something" is a part of the album "Abbey Road" by The Beatles, and was later covered by Joe Cocker."
引述
"Leveraging the NLP capabilities of open-source large language models to enhance the quality of conversational history can significantly improve the performance of query rewriting in open-domain conversational search."
"We demonstrate on five well-established benchmarks that CHIQ leads to state-of-the-art results across most settings, showing highly competitive performances with systems leveraging closed-source LLMs."