NAIST-SIC-Aligned: A Large-Scale Parallel English-Japanese Simultaneous Interpretation Corpus for Improving Simultaneous Machine Translation
This work introduces NAIST-SIC-Aligned, a large-scale parallel English-Japanese simultaneous interpretation (SI) corpus, to address the lack of SI data for training and evaluating simultaneous machine translation (SiMT) systems.