แนวคิดหลัก
Introducing Multiple Embedding Model for EHR (MEME), a novel approach that converts tabular EHR data into textual "pseudo-notes" to effectively leverage Large Language Models for improved performance on various Emergency Department prediction tasks.
บทคัดย่อ
This paper introduces Multiple Embedding Model for EHR (MEME), a novel approach that converts tabular EHR data into textual "pseudo-notes" to effectively leverage Large Language Models (LLMs) for improved performance on various Emergency Department (ED) prediction tasks.
Key highlights:
- MEME generates "pseudo-notes" by inserting tabular EHR data into template sentences, transforming the data into a textual format that can be processed by LLMs.
- MEME adopts a multimodal approach, encoding each EHR modality (e.g., arrival information, triage, medications) separately, and then using a self-attention layer to analyze the combined representation.
- MEME is evaluated on two sets of ED prediction tasks: 1) Predicting ED disposition (discharge vs. admission), and 2) Predicting ED decompensation measures (discharge location, ICU admission, mortality) for admitted patients.
- MEME outperforms both single modality embedding methods and traditional machine learning approaches like Random Forest on these tasks.
- However, the authors observe notable limitations in the generalizability of all tested models across different hospital institutions, highlighting the need for more representative public datasets.
สถิติ
The patient was previously taking the following medications: albuterol sulfate, peg 3350-electrolytes, nicotine, spironolactone [aldactone], emtricitabine-tenofovir [truvada], raltegravir [isentress], furosemide, ipratropium bromide [atrovent hfa], ergocalciferol (vitamin d2).
The patient received the following diagnostic codes: ICD-9 code: [78959], ICD-9 code: [07070], ICD-9 code: [5715], ICD-9 code: [v08].
The patient received the following medications: morphine, donnatol (elixir), aluminum-magnesium hydrox.-simet, ondansetron.
คำพูด
"MEME employs "pseudo-notes" to convert EHR raw tabular data into clinically meaningful text."
"We demonstrate that multimodal representation outperforms traditional machine learning and LLM-based models which represent EHR as a single heterogeneous data modality across multiple tasks."
"We find that the representation derived from the MIMIC-IV Database is insufficient for generalizing across different hospital systems."