Core Concepts
A structured, interpretable, and interactive framework for generating chest X-ray reports using anatomical region detection, anatomical and clinical prompts, and a pre-trained large language model.
Abstract
The proposed method introduces a structured report generation framework that leverages a pre-trained large language model (LLM) guided by anatomical regions and clinical contextual prompts. The key aspects of the approach are:
Anatomy-Guided Structured Report Foundation:
Anatomical regions in chest X-rays are detected and used to generate focused sentences that center on key visual elements.
This establishes a structured report foundation with anatomy-based sentences.
Anatomical information is also converted into textual prompts to guide the LLM's understanding of the anatomy.
Clinical Context Integration:
Clinical context prompts, such as the patient's medical history and reason for examination, are incorporated to provide relevant information.
This enables physician interactivity, allowing them to actively participate in the report generation process by providing context.
LLM-Driven Report Generation:
The anatomy-focused sentences, anatomical prompts, and clinical context prompts are integrated into prompts for a pre-trained LLM.
The LLM then generates the final structured report by coordinating and consolidating these data sources.
The proposed framework addresses key issues in medical report generation, such as lack of structure, interpretability, and interactivity. By leveraging anatomical regions, prompts, and a powerful LLM, the method produces structured, interpretable, and clinically relevant chest X-ray reports.
The authors evaluate the approach on the MIMIC-CXR dataset using both natural language generation and clinical effectiveness metrics. The results demonstrate strong performance, outperforming state-of-the-art methods in terms of language quality, fluency, and clinical correctness.
Stats
There is no pneumothorax or pleural effusion.
Moderate pulmonary edema with bilateral pleural effusions.
Bibasilar atelectasis is unchanged.
Mild pulmonary edema.
The aorta is calcified.
Moderate cardiomegaly.
Quotes
"Our method introduces a prompt-guided approach to generate structured chest X-ray reports using a pre-trained large language model (LLM)."
"By integrating anatomy-focused sentences and anatomy/clinical prompts, the pre-trained LLM can generate structured chest X-ray reports tailored to prompted anatomical regions and clinical contexts."