Generating Synthetic Datasets for Evaluating Program Similarity Approaches
This paper presents a framework for generating large, synthetic datasets with known ground truth program similarity to aid in the evaluation of novel program similarity approaches.