Core Concepts
This study investigates the design strategies and opportunities for interactive visualization tools in computational notebooks through a systematic review of 163 notebook visualization tools.
Abstract
The authors conducted a systematic review of 163 notebook visualization tools, including 64 systems from academic papers and 105 tools sourced from a pool of 55k notebooks containing interactive visualizations scraped from 8.6 million notebooks on GitHub.
Key highlights:
Motivations for developing notebook visualization tools: seamless workflow integration, easy access to read and refine artifacts, portability and shareability, and ease of implementation.
Design patterns characterized by a four-dimensional framework: notebook-visualization integration, data source and type, display style and sensemaking context, and modularity.
Design implications and trade-offs, such as the need to engage with targeted user groups, balancing notebook integration and platform compatibility, and considering display styles and modularity based on user needs.
Empirical evidence that tools supporting more notebook platforms have significantly greater impact in terms of GitHub stars and paper citations.
Development of SuperNOVA, an interactive browser to help researchers explore existing notebook visualization tools.
The authors discuss future research opportunities, such as democratizing notebook visualization tool creation, enriching fluid notebook-visualization integration, and promoting responsible AI through notebook workflows.
Stats
"Notebooks are the most popular programming environment among data scientists [84]."
"Many researchers have developed notebook visualization tools to promote adoption among data scientists."
"Scientists use notebooks as an interface for accessing remote clusters [163], and publishing notebooks with academic papers is considered good practice for reproducible research [76]."
"Educators also use notebooks for assigning and grading programming assignments [79]."
"Tools that support more notebook platforms have significantly more GitHub stars and paper citations."
Quotes
"To shed light on the existing landscape of notebook visualization tools and help visualization researchers and practitioners harness the potential of notebook environments, we contribute: The first systematic review of 163 notebook visualization tools including 64 systems introduced in academic papers and 105 tools sourced from a pool of 55k notebooks containing interactive visualizations that we obtain via scraping 8.6 million notebooks on GitHub."
"Notebooks are the most popular programming environment among data scientists [84]. Consequently, many researchers have developed notebook visualization tools to promote adoption among data scientists."
"Scientists use notebooks as an interface for accessing remote clusters [163], and publishing notebooks with academic papers is considered good practice for reproducible research [76]."
"Educators also use notebooks for assigning and grading programming assignments [79]."