This research paper introduces VERITAS-NLI, a novel system for fake news detection that surpasses the limitations of traditional methods by combining web scraping and Natural Language Inference (NLI).
Research Objective:
The study addresses the growing concern of fake news by developing a system that can effectively identify unreliable headlines in a rapidly evolving news environment.
Methodology:
VERITAS-NLI employs web scraping techniques to retrieve external knowledge from reputable sources based on the input headline. This information is then processed by NLI models (FactCC and SummaC) to detect inconsistencies between the headline and the retrieved content. The system utilizes three distinct pipelines: Question-Answer Pipeline, Small Language Model Pipeline, and Article Pipeline, each employing different scraping and NLI approaches.
Key Findings:
The study demonstrates that VERITAS-NLI significantly outperforms classical machine learning models and BERT in fake news detection. The Article Pipeline, using SummaC-ZS as the NLI model, achieved the highest accuracy of 84.3%, a substantial improvement over baseline models. The research also highlights the effectiveness of SummaC over FactCC for headline inconsistency detection due to its sentence-level granularity.
Main Conclusions:
VERITAS-NLI offers a robust and adaptable solution for combating fake news by dynamically verifying claims against real-time information. The system's reliance on web scraping and NLI allows it to remain relevant and effective in a constantly changing news landscape.
Significance:
This research contributes to the field of fake news detection by proposing a novel approach that addresses the limitations of static training data and enhances accuracy. The findings have practical implications for developing more reliable and trustworthy news verification systems.
Limitations and Future Research:
While VERITAS-NLI shows promising results, further research can explore the use of larger language models for question generation and investigate the impact of source credibility on the system's performance.
Til et annet språk
fra kildeinnhold
arxiv.org
Viktige innsikter hentet fra
by Arjun Shah, ... klokken arxiv.org 10-15-2024
https://arxiv.org/pdf/2410.09455.pdfDypere Spørsmål