核心概念
KnowHalu proposes a two-phase process for detecting hallucinations in text generated by large language models (LLMs). The first phase identifies non-fabrication hallucinations, while the second phase performs multi-form knowledge-based factual checking to detect fabrication hallucinations.
要約
The paper introduces KnowHalu, a novel approach for detecting hallucinations in text generated by large language models (LLMs). The key highlights are:
-
Non-Fabrication Hallucination Checking:
- This phase identifies non-fabrication hallucinations, where the answer is factually correct but irrelevant or non-specific to the query.
- It uses an extraction-based specificity check to reduce false positives and effectively identify non-fabrication hallucinations.
-
Factual Checking:
- This phase consists of five steps: (a) Step-wise Reasoning and Query, (b) Knowledge Retrieval, (c) Knowledge Optimization, (d) Judgment Based on Multi-form Knowledge, and (e) Aggregation.
- It decomposes the original query into a sequence of simpler, one-hop sub-queries to enhance the accuracy of knowledge retrieval.
- It leverages both unstructured knowledge (e.g., normal semantic sentences) and structured knowledge (e.g., object-predicate-object triplets) for factual checking, capturing a comprehensive spectrum of factual information.
- The aggregation mechanism combines the judgments based on different forms of knowledge to further reduce the hallucinations in the final prediction.
The extensive experiments demonstrate that KnowHalu significantly outperforms state-of-the-art baselines in detecting hallucinations across diverse tasks, such as improving by 15.65% in the QA task and 5.50% in the summarization task.
統計
"Star Wars," released in 1977, is the space-themed movie in which the character Luke Skywalker first appeared.
John Williams composed the score for "Star Wars."
引用
"KnowHalu proposes a two-phase process for detecting hallucinations in text generated by large language models (LLMs), utilizing step-wise reasoning, multi-formulation query, multi-form knowledge for factual checking, and fusion-based detection mechanism."
"Our extensive evaluations demonstrate that KnowHalu significantly outperforms SOTA baselines in detecting hallucinations across diverse tasks, e.g., improving by 15.65% in QA tasks and 5.50% in summarization tasks, highlighting its effectiveness and versatility in detecting hallucinations in LLM-generated content."