Core Concepts
A novel methodology adept at decoding multilingual topic dynamics and identifying communication trends during crises using ARIMA time series analysis.
Abstract
The content delves into a novel methodology for decoding multilingual topic dynamics and identifying communication trends during crises. It introduces a data translation framework enhanced by LDA/HDP models, focusing on Tunisian social networks during the Coronavirus Pandemic. The process involves aggregating a multilingual corpus, translating it using a No-English-to-English Machine Translation approach, applying advanced modeling techniques like LDA and HDP models, and utilizing ARIMA time series analysis to decode evolving topic trends. The study aims to provide insights vital for organizations and governments striving to understand public perspectives during crises.
Directory:
Introduction
Challenges in crisis communication due to diverse linguistic environments on social media.
Proposal of a data-driven methodology for multilingual topic modeling.
Related Works
Comparison of monolingual vs. multilingual topic modeling approaches.
Studies on topic modeling in crisis communication using social media data.
Machine Translation Techniques
Overview of Rule-Based, Statistical, Neural, and Hybrid machine translation methods.
Multilingual Text Processing in Social Media Analysis
Exploration of sentiment analysis across different languages using deep learning techniques.
Proposed Methodology
Five primary phases: Data Collection, Data Preprocessing, No English-English Machine Translation approach, Topic Modeling with LDA/HDP models, Trends Identification with ARIMA model.
Topic Modeling Proposed Approach
Leveraging LDA and HDP algorithms to extract latent topics from English translated data.
Pre-Identification Trends Processing
Converting trend identification into a supervised learning task to define constant trends.
Stats
"Our model outperforms as confirmed by metrics like Coherence Score."
"Applying our method effectively identified key topics mirroring public sentiment."
Quotes
"No-English-to-English Machine Translation approach showed high accuracy and F1 scores."
"Our model outperforms standard approaches as confirmed by metrics like Coherence Score."