Core Concepts
Automated fact-checking systems often rely on external evidence from the web, but this evidence can be unreliable or leaked from existing fact-checking articles, undermining the effectiveness of such systems. This work proposes a comprehensive approach to evidence verification and filtering to address these challenges.
Abstract
The content discusses the problem of unreliable and leaked evidence in automated fact-checking (AFC) systems. It highlights that while AFC systems leverage external information from the web to help examine the veracity of claims, they often overlook the importance of verifying the source and quality of the collected "evidence".
The key points are:
Reliance on "leaked evidence" (information gathered directly from fact-checking websites) and inclusion of information from unreliable sources can undermine the effectiveness of AFC systems.
To address these challenges, the authors propose a comprehensive approach to evidence verification and filtering. They create the "CREDible, Unreliable or LEaked" (CREDULE) dataset, which consists of 91,632 articles classified as Credible, Unreliable and Fact-checked (Leaked).
The authors also introduce the EVidence VERification Network (EVVER-Net), trained on CREDULE to detect leaked and unreliable evidence in both short and long texts. EVVER-Net can be used to filter evidence collected from the web, thus enhancing the robustness of end-to-end AFC systems.
Experiments show that EVVER-Net can demonstrate impressive performance of up to 91.5% and 94.4% accuracy, while leveraging domain credibility scores along with short or long texts, respectively.
The authors assess the evidence provided by widely-used fact-checking datasets, including LIAR-PLUS, MOCHEG, FACTIFY, NewsCLIPpings+ and VERITE, and identify concerning rates of leaked and unreliable evidence.
Stats
"Obama, Out of Office 10 Days, Speaks Out Against Immigration Ban"
"Obama Rejects Trump Immigration Orders, Backs Protests"
"Did President Obama Ban Muslims from Entering the United States in 2011?"
"MORE HYPOCRISY: Obama Banned all Iraqi Refugees for 6 Months in 2011 – Liberals SAID NOTHING"
Quotes
"Automated fact-checking (AFC) is garnering increasing attention by researchers aiming to help fact-checkers combat the increasing spread of misinformation online."
"One overlooked challenge involves the reliance on "leaked evidence", information gathered directly from fact-checking websites and used to train AFC systems, resulting in an unrealistic setting for early mis-information detection."
"Similarly, the inclusion of information from unreliable sources can undermine the effectiveness of AFC systems."