Core Concepts
The core message of this article is to propose a computational model and visualization methods to identify the strength and weakness rules of individual cricket players using unstructured short text commentary data.
Abstract
The article presents a system to process and analyze unstructured cricket short text commentary data to extract insights about individual player's strengths and weaknesses.
Key highlights:
Proposes the use of unstructured cricket short text commentary data for visualization, which is an untapped resource compared to the commonly used structured data like box-score and tracking data.
Introduces a computational definition of strength rule and weakness rule of a player, which captures the relationship between the player's batting features and the opponent's bowling features.
Presents visualization methods to interpret the obtained strength and weakness rules, as well as to identify players with similar strengths and weaknesses.
Validates the proposed approach through expert analysis and statistical tests, demonstrating the accuracy of the extracted rules.
Provides additional visualizations to analyze a player's outcomes, shot areas, and footwork on different delivery types.
Discusses the limitations of traditional text visualization techniques like word clouds in capturing the nuanced relationships between batting and bowling features.
The article demonstrates how unstructured cricket commentary data can be leveraged to gain deeper insights about individual player's strategies and performance, which can augment the existing sports visualization techniques that primarily rely on structured data.
Stats
The short text commentary data is used to construct a confrontation matrix that captures the co-occurrences of batting features and bowling features.
Some key statistics from the confrontation matrix:
Steve Smith has scored 1331 runs off good length deliveries in his career.
Steve Smith has been beaten 106 times on deliveries that move away from him.
Steve Smith has attacked 269 times on short length deliveries.
Quotes
"Steve Smith attacks the deliveries that are bowled on the leg stump."
"Steve Smith gets beaten on the deliveries that are swinging."