The paper presents an integrated framework called CureFun that utilizes the capabilities of LLMs to simulate patient roles in clinical education. The key highlights are:
Data Processing: The framework constructs a structured case graph by extracting entities, relationships, and attributes from the original patient scripts using information extraction techniques. This enables efficient retrieval and generation of relevant information during the student-patient dialogue.
Graph-Driven Context-Adaptive SP Chatbot: CureFun integrates a graph-driven mechanism to dynamically adjust the dialogue flow and generate coherent responses. It can synthesize rational attributes based on the known information in the case graph to maintain consistency, even when the user asks about missing details.
LLM-based Automatic Assessment: The framework transforms the complex evaluation checklists into an automated scoring program compatible with LLMs. It employs an ensemble of LLMs to provide comprehensive and reliable assessments of students' medical dialogues, enabling large-scale and efficient SP-involved evaluations.
Evaluation: Comprehensive experiments demonstrate that CureFun enables more authentic and professional dialogue flows in SP scenarios compared to other LLM-based chatbots. The automatic assessment method also shows a high correlation with human evaluators' scores.
LLMs as Virtual Doctors: Leveraging the assessment capability, the study evaluates the diagnostic abilities of various LLMs, providing insights into the potential and limitations of using LLMs as virtual doctors in medical consultation.
Overall, the proposed framework highlights the potential of LLMs as VSPs for more efficient and scalable clinical education, while also offering valuable insights into the development of medical LLMs for intelligent diagnosis and treatment.
다른 언어로
소스 콘텐츠 기반
arxiv.org
더 깊은 질문