The proposed system aims to address the prevalence of errors in chest X-ray (CXR) diagnoses, particularly among inexperienced radiologists and hospital residents, by understanding radiologists' intentions and the corresponding regions of interest. The system comprises two main modules: Temporally Grounded Intention Detection (TGID) and Region Extraction (RE).
The TGID module utilizes the fixation heatmap video and the time steps embedded in the radiology report as inputs to predict the main intentions in the radiology report with the corresponding temporal grounding. The RE module then extracts clips from the input video based on the predicted time steps and the identified intention to determine a representative image for the region of interest associated with the intended purpose.
The key contributions of this work include the development of a novel system for comprehending radiologists' intentions and the corresponding regions of interest, the introduction of evaluation strategies, and the pioneering of a new task known as radiologist intention detection within the medical domain. The system has the potential to rectify mistakes made by inexperienced radiologists, guide them to the correct regions of interest, and serve as a valuable tool for enhancing diagnostic accuracy and fostering continuous learning within the medical community.
Vers une autre langue
à partir du contenu source
arxiv.org
Questions plus approfondies