Core Concepts
Detecting and classifying temporally ambiguous questions is crucial for improving open-domain question answering systems, as the answers to such questions can vary depending on the specific time frame.
Abstract
The paper introduces TEMPAMBIQA, a manually annotated dataset of 8,162 open-domain questions, with 3,879 labeled as temporally ambiguous and 4,283 as unambiguous. The authors propose and test various search strategies to efficiently detect temporal ambiguity in questions, including Linear Search, Skip-List Search, Random Search, and Divide-and-Conquer approaches.
The key highlights and insights from the paper are:
- Temporal ambiguity in questions is a significant challenge in open-domain question answering, as the answers can vary depending on the time frame.
- The TEMPAMBIQA dataset provides a valuable resource for studying temporal ambiguity detection, with a mix of ambiguous and unambiguous questions from various existing datasets.
- The proposed search strategies, such as Skip-List (2) and Random (5) Search, demonstrate promising performance in detecting temporal ambiguity, with the Qwen-110B model achieving the best overall results.
- Zero-Shot and Few-Shot approaches using large language models like GPT-4 and LLaMA-70B also show potential for handling novel temporal contexts, though fine-tuned BERT models exhibit a trade-off between precision and recall.
- The efficiency of the search strategies is also analyzed, with the Skip-List (2) approach offering a good balance between performance and computational cost.
The authors also discuss the limitations of their work, such as the reliance on the specified time frame and the dependence on the knowledge embedded in the large language models used.
Stats
Over 50% of questions in Google search queries are ambiguous. (Min et al., 2020)
The TEMPAMBIQA dataset contains 8,162 questions, with 3,879 labeled as temporally ambiguous and 4,283 as unambiguous.
Quotes
"Temporal ambiguity, in particular, occurs when a question involves unclear or unspecified time frames, leading to different answers depending on the assumed temporal context."
"Temporal ambiguity poses unique challenges for QA systems, as these need to understand the temporal context of a question to provide the correct answer."