Hallucination Detection in Large Language Models

Đăng nhập

thông tin chi tiết - Hallucination Detection in Large Language Models

Harnessing Unlabeled Large Language Model Generations for Effective Hallucination Detection

Leveraging unlabeled large language model generations in the wild to effectively detect hallucinated content through an automated membership estimation approach.

Detecting Hallucinations in Large Language Models: A Novel Approach

Detecting a specific subclass of hallucinations, termed confabulations, in large language models to address the problem of factually incorrect or irrelevant responses.

‘Fighting fire with fire’ — using LLMs to combat LLM hallucinations

Detecting Fabricated Outputs in Large Language Models Using Semantic Entropy

Large language models often generate false or unsubstantiated outputs, known as "hallucinations", which prevent their adoption in critical domains. This work proposes a general method to detect a subset of hallucinations, called "confabulations", by estimating the semantic entropy of model outputs.

Detecting hallucinations in large language models using semantic entropy - Nature

Measuring Hallucinations in Large Language Models: The Hallucinations Leaderboard

The Hallucinations Leaderboard is an open initiative to quantitatively measure and compare the tendency of large language models to produce hallucinations - outputs that do not align with factual reality or the input context.

Detecting and Forecasting Hallucinations in Large Language Models via State Transition Dynamics

Hallucinations in large language models can be effectively detected by analyzing the model's internal state transition dynamics during generation using tractable probabilistic models.

SHROOM-INDElab's Hallucination Detection System for SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification

The SHROOM-INDElab system uses prompt engineering and in-context learning with large language models (LLMs) to build classifiers for hallucination detection, achieving competitive performance in the SemEval-2024 Task 6 competition.

KnowHalu: A Novel Approach for Detecting Hallucinations in Text Generated by Large Language Models

KnowHalu proposes a two-phase process for detecting hallucinations in text generated by large language models (LLMs). The first phase identifies non-fabrication hallucinations, while the second phase performs multi-form knowledge-based factual checking to detect fabrication hallucinations.

Giới thiệu

Sản Phẩm

Tài Nguyên