Core Concepts
Understanding the temporal regularities of information consumption on Wikipedia reveals distinct patterns based on topics, access methods, and user countries.
Abstract
The study delves into the temporal regularities of Wikipedia consumption, revealing daily rhythms and patterns of individual articles. It explores the impact of topics, access methods, and user countries on consumption habits, providing insights for information systems design and understanding global information needs.
Abstract: Investigates temporal regularities in Wikipedia consumption patterns.
Introduction: Explores human life's cyclical nature and its reflection in digital behavior.
Data Extraction: Analyzes English Wikipedia's access logs and article properties.
Principal Components: Identifies prototypical shapes of consumption patterns.
Clustering: Examines clustering of articles based on access rhythms.
Topics and Access Methods: Investigates the relationship between topics, access methods, and time.
Country Analysis: Explores the influence of user countries on consumption patterns.
Ablation Study: Assesses the strength of factors in predicting temporal access rhythms.
Discussion: Discusses implications for information needs, cultural diversity, metrics, customization, and infrastructure optimization.
Stats
"We retain 3.45B pageloads associated with 6.3M articles."
"The first four principal components capture 73.6% of the total variance."
"Permuting 'time by country' reduces the R2 to -0.02984."
"Permuting all three factors reduces the R2 to -0.429."
Quotes
"Wikipedia as a platform fulfills multiple information needs."
"Our study offers insights into what content people consume online during the day."
"Understanding the content that draws more attention at a different time of the day has implications for design beyond Wikipedia."