NLG systems often produce inaccurate but fluent outputs, leading to hallucinations that challenge correctness.


coremsg

shroom-semeval-2024-shared-task-6-on-hallucinations-detection


SHROOM: SemEval-2024 Shared Task 6 on Hallucinations Detection


title_rewrite


The author presents the results of the SHROOM shared task focused on detecting hallucinations in natural language generation systems. The task aimed to address the challenge of fluent but inaccurate outputs that jeopardize the correctness of NLG applications.


semeval-2024-shared-task-6-shroom-hallucination-detection


SemEval-2024 Shared Task 6: SHROOM, Hallucination Detection