Core Concepts
DyPyBench provides a large-scale, diverse, and ready-to-analyze benchmark of executable Python projects to facilitate dynamic program analysis.
Abstract
DyPyBench introduces a benchmark suite encompassing 50 Python projects with 681k lines of code and 30k test cases. It enables various applications in testing and dynamic analysis, such as comparing static and dynamic call graphs, training neural models like LExecutor, and mining API usage specifications from execution traces. The benchmark aims to address the lack of comprehensive executable Python project benchmarks for dynamic analyses.
Key points:
DyPyBench is the first large-scale benchmark suite for executable Python projects.
It includes 50 diverse projects with extensive test suites.
Applications include comparing call graphs, training neural models, and mining API specifications.
Provides a basis for future research in dynamic analysis of Python code.
Stats
The benchmark encompasses 50 popular open-source projects from various application domains.
Totaling 681k lines of Python code and 30k test cases.