Comprehensive Dataset and Model for Vietnamese Optical Character Recognition-based Visual Question Answering
The authors introduce ViOCRVQA, a novel large-scale dataset for Vietnamese OCR-VQA, and propose a novel approach called VisionReader that outperforms state-of-the-art methods on this dataset.