Large language models (LLMs) can distinguish between the relevance and utility of passages in supporting open-domain question answering, and their utility judgments can provide more valuable guidance than relevance judgments in identifying ground-truth evidence necessary for answering questions. However, the performance of LLMs in utility judgments is affected by various factors in the instruction design, such as the input form of passages, the sequence of input between the question and passages, and additional requirements like chain-of-thought and reasoning.


coremsg

are-large-language-models-capable-of-accurately-judging-the-utility-of-evidence-for-open-domain-question-answering-


Are Large Language Models Capable of Accurately Judging the Utility of Evidence for Open-Domain Question Answering?


title_rewrite


MFORT-QA leverages few-shot learning, chain-of-thought prompting, and retrieval-augmented generation to accurately answer complex questions by extracting relevant information from tables and associated hyperlinked contexts.


multi-hop-few-shot-open-rich-table-question-answering-leveraging-large-language-models-and-retrieval-techniques-for-accurate-answers


Multi-hop Few-shot Open Rich Table Question Answering: Leveraging Large Language Models and Retrieval Techniques for Accurate Answers