Core Concepts
SoftTiger introduces a clinical large language model (CLaM) to structure clinical notes into data, addressing healthcare workflow challenges and improving patient care efficiency.
Abstract
SoftTiger is a groundbreaking model designed to structure clinical notes into valuable data for healthcare workflows. It addresses the challenges faced by healthcare professionals in managing unstructured clinical narratives. By fine-tuning a state-of-the-art LLM, SoftTiger outperforms other models and aims to enhance digitalization in healthcare.
The healthcare sector faces critical challenges due to high demand and physician burnout, emphasizing the need for innovative solutions like SoftTiger. The model's release of 13 billion and 70 billion parameter scales, along with datasets and code, contributes significantly to the industry. SoftTiger's focus on structuring patient information through tasks like international patient summary, clinical impression, and medical encounter showcases its potential impact on healthcare workflows.
The training methods employed by SoftTiger involve supervised fine-tuning on general-purpose foundation LLMs like Llama-2 and TigerBot. The model's strategic approach prioritizes domain adaptation while ensuring lightweight development for rapid experimentation. SoftTiger's performance evaluation demonstrates its superiority over other models in processing clinical notes efficiently.
Overall, SoftTiger represents a significant advancement in leveraging LLMs for structured clinical data management. Its contributions include releasing family models at different scales, developing algorithmic implementations specific to the healthcare domain, and open-sourcing training data for enabling comprehensive workflow tasks.
Stats
Our blind pairwise evaluation shows that SoftTiger outperforms other popular open-source models and GPT-3.5.
SoftTiger models are released at scales of 13 billion and 70 billion parameters.
An annotated sample of 620 clinical notes shows that 75% exceed 2k tokens.
Nearly half of physicians' time is devoted to digital paperwork rather than direct patient care.
The predicted shortfall of health workforce by 2030 is estimated at 18 million health workers.
Quotes
"LLMs may become a step-stone towards healthcare digitalization and democratization." - Authors