Bansal, S., Singh, N. S., Dar, S. S., & Kumar, N. (2024). MMCFND: Multimodal Multilingual Caption-aware Fake News Detection for Low-resource Indic Languages. arXiv preprint arXiv:2410.10407.
This paper addresses the lack of robust methods for detecting multimodal fake news in low-resource Indic languages by introducing a new dataset and a novel framework that leverages multimodal and multilingual approaches.
The authors curated a large-scale dataset called MMIFND, containing 28,085 real and fake news samples in seven Indic languages. They proposed a framework called MMCFND, which utilizes pre-trained unimodal encoders (MuRIL for Indic text, FLAVA for English text, NASNet for images) and a pairwise encoder from FLAVA for aligning vision and language. Additionally, they employed BLIP-2 to generate descriptive image captions, enriching the visual representation. These features were then fused and fed into a classifier to determine the authenticity of news articles.
The authors found that features retrieved from the foundational model (FLAVA) enhanced unimodal features with crucial additional information. Their proposed MMCFND framework outperformed existing fake news detection methods on the MMIFND dataset, demonstrating the effectiveness of their approach.
The study highlights the importance of multimodal and multilingual approaches for fake news detection in low-resource languages. The curated MMIFND dataset and the proposed MMCFND framework provide valuable resources for future research in this domain.
This research significantly contributes to combating misinformation in the Indian subcontinent by providing a comprehensive dataset and a robust framework for detecting fake news in low-resource Indic languages.
The study primarily focuses on seven Indic languages. Future research could expand the dataset to include more languages and explore the generalization of the proposed framework to other low-resource language contexts. Additionally, investigating the impact of different image captioning models and exploring alternative multimodal fusion techniques could further enhance the framework's performance.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Shubhi Bansa... klo arxiv.org 10-15-2024
https://arxiv.org/pdf/2410.10407.pdfSyvällisempiä Kysymyksiä