This research paper introduces a comprehensive dataset of posts from X (formerly Twitter) related to the 2024 U.S. Presidential Election, collected between May 1, 2024, and July 31, 2024. The researchers developed a custom scraping engine called X-Scraper to gather a wide range of election-related content, including posts, metadata, and user information.
The primary objective of this research is to provide a publicly available dataset that captures the dynamics of political discourse on X during the 2024 U.S. Presidential Election. This dataset aims to facilitate research on the influence of social media on public opinion, the spread of misinformation, and the role of key figures in shaping online narratives.
The researchers developed a custom scraping engine, X-Scraper, to collect publicly available data from X.com. The scraper utilizes targeted keywords related to the 2024 election, political figures, and emerging events. It gathers various post-specific details, including content, media, user metadata, and user interface interactions. The data collection was divided into smaller intervals to account for specific events and discourse patterns.
This dataset offers a valuable resource for researchers to analyze trends in public opinion, investigate the spread of misinformation, and examine the influence of key figures on X during the 2024 U.S. Presidential Election. The researchers acknowledge limitations related to data representativeness and plan to expand the dataset and incorporate data from other sources in future work.
This research is significant as it provides a large-scale, publicly available dataset that can be used to study the complex interplay between social media and political processes during a crucial election cycle. The insights derived from this dataset can inform strategies to safeguard election integrity, mitigate misinformation, and promote a more informed and balanced online political discourse.
The researchers acknowledge limitations related to the representativeness of data collected solely from X and potential biases introduced by keyword-based scraping. Future work will focus on continuous data collection, analysis of verified users and suspected bots, and incorporating data from other social media platforms to provide a more comprehensive understanding of online political discourse.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Ashwin Balas... at arxiv.org 11-04-2024
https://arxiv.org/pdf/2411.00376.pdfDeeper Inquiries