Evaluating Hallucination Risks in Medical Visual Question Answering Models
The core message of this paper is to create a benchmark dataset for evaluating the hallucination phenomenon in state-of-the-art medical visual question answering (Med-VQA) models, and to provide a comprehensive analysis of their performance on this benchmark.