Classifying Conspiracy Theories in Online Discussions: Identifying Established and Emerging Narratives at Scale
מושגי ליבה
Online discussions frequently involve conspiracy theories, which can contribute to the proliferation of belief in them. This work establishes a general scheme for classifying discussions related to conspiracy theories based on authors' perspectives, which can be expressed explicitly through narrative elements or implicitly through references to known theories.
תקציר
The paper presents a comprehensive study on identifying conspiracy theory (CT) narratives in online discussions, particularly in Reddit communities dedicated to CTs. The key highlights are:
-
Establishment of a general, topic-independent classification scheme that considers the multifaceted nature of CT narratives, including the author's perspective (promoting or debunking) expressed explicitly through narrative elements or implicitly through references to known theories.
-
Development of BERT-based models that outperform traditional machine learning approaches and the state-of-the-art generative model GPT in classifying CT content. The best-performing RoBERTa model achieved an AUC of 0.787.
-
Analysis of the strengths and limitations of GPT in CT detection, revealing its ability to recognize well-known CTs but struggles with more nuanced cues, such as sarcasm and implicit connections.
-
Large-scale classification of posts from the most active CT-related Reddit forums, finding that only one-third of the posts are classified as CT narratives, suggesting that not all content from these forums necessarily promotes CTs.
The study provides valuable insights into the prevalence and characteristics of CT narratives in online discussions, highlighting the need for nuanced approaches to accurately identify and understand the spread of conspiracy theories.
Classifying Conspiratorial Narratives At Scale
סטטיסטיקה
"The corruption in science and the censorship in the media is one and the same: peer review."
"Dr. Strangelove was right, except with chlorine instead of fluoride. Semen is not supposed to smell like bleach, chlorine and chlorides are getting absorbed when you bathe in/drink/inhale it."
"Turkey rocked by 7M Quake hours after France attack; World leaders Outraged at Macron!"
"How does marxism/communism take over so many countries? It happened to Europe, the UK, Russia and is now happening to the US. How are they able to do this to every country while so many people don't even realize it?"
ציטוטים
"The corruption in science and the censorship in the media is one and the same: peer review."
"Dr. Strangelove was right, except with chlorine instead of fluoride. Semen is not supposed to smell like bleach, chlorine and chlorides are getting absorbed when you bathe in/drink/inhale it."
"Turkey rocked by 7M Quake hours after France attack; World leaders Outraged at Macron!"
"How does marxism/communism take over so many countries? It happened to Europe, the UK, Russia and is now happening to the US. How are they able to do this to every country while so many people don't even realize it?"
שאלות מעמיקות
How can the classification model be further improved to better detect emerging and novel conspiracy theories that do not explicitly reference known theories?
To enhance the classification model's ability to detect emerging and novel conspiracy theories that do not explicitly reference known theories, several strategies can be implemented:
Continuous Training: Regularly updating the model with new data containing emerging conspiracy theories can help it stay current and adapt to evolving narratives.
Semantic Understanding: Incorporating advanced natural language processing techniques to improve the model's semantic understanding can help identify subtle cues and implicit references to conspiracy theories.
Contextual Analysis: Enhancing the model's contextual comprehension capabilities can enable it to recognize the underlying themes and connections in posts that hint at emerging conspiracy theories.
Pattern Recognition: Developing algorithms that can identify patterns in language usage and topic shifts in online discussions can aid in flagging posts that exhibit characteristics of emerging conspiracy theories.
Collaborative Filtering: Implementing a collaborative filtering approach where the model learns from human experts or fact-checkers to identify and classify new conspiracy theories can improve its accuracy in detecting novel narratives.
By incorporating these strategies, the classification model can become more adept at detecting emerging and novel conspiracy theories that may not explicitly reference known theories.
What are the potential societal and political implications of the finding that only one-third of posts in CT-focused forums are classified as promoting conspiracy theories?
The finding that only one-third of posts in CT-focused forums are classified as promoting conspiracy theories has several significant societal and political implications:
Misinformation Mitigation: Identifying that a large portion of posts in CT forums do not promote conspiracy theories can help in distinguishing between genuine discussions and misinformation, enabling targeted interventions to counter false narratives.
Public Perception: Understanding the prevalence of non-conspiratorial content in CT forums can influence public perception of these platforms, potentially reducing the stigma associated with them and fostering more nuanced discussions.
Algorithmic Impact: Platforms may need to reevaluate their algorithms to ensure that non-conspiratorial content is not inadvertently promoted or amplified, thereby mitigating the spread of harmful conspiracy theories.
Policy Development: Policymakers and regulators can use this information to tailor interventions and policies that address the dissemination of conspiracy theories online, focusing on the specific content that promotes harmful narratives.
Community Building: Recognizing the diversity of content in CT forums can pave the way for building communities that engage in constructive dialogue, debunking misinformation, and fostering critical thinking.
By acknowledging the prevalence of non-conspiratorial content in CT forums, stakeholders can develop targeted strategies to address the spread of harmful conspiracy theories and promote more informed discussions online.
How can the insights from this study be leveraged to develop effective interventions and counter-narratives to mitigate the spread of harmful conspiracy theories online?
The insights from this study can be leveraged to develop effective interventions and counter-narratives to mitigate the spread of harmful conspiracy theories online through the following approaches:
Education and Awareness Campaigns: Utilize the classification model to identify and target posts promoting conspiracy theories for fact-checking and provide educational resources to debunk misinformation.
Community Moderation: Empower community moderators with the tools and knowledge to identify and address conspiracy theories effectively, fostering a culture of critical thinking and healthy discourse.
Algorithmic Adjustments: Work with social media platforms to adjust algorithms to reduce the visibility of harmful conspiracy theories and promote authoritative sources of information.
Collaborative Efforts: Collaborate with researchers, fact-checkers, and tech companies to develop comprehensive strategies for combating the spread of conspiracy theories online.
Psychological Support: Provide psychological support and resources for individuals who may be susceptible to believing in conspiracy theories, offering alternative narratives and critical thinking skills.
By leveraging these insights and implementing targeted interventions, it is possible to create a more informed and resilient online community that is better equipped to counter the spread of harmful conspiracy theories.