Five Novel Datasets for Challenging Key Information Extraction Tasks in Enterprise Settings
This paper introduces RealKIE, a benchmark of five document datasets that present realistic challenges for key information extraction tasks, including poor document quality, sparse annotations in long documents, and complex tabular layouts.