Saad Odelyah, Saban Mor, Kerner Erika, Levin Chedva
Jerusalem College of Technology, Jerusalem, Israel.
Ariel University, Ariel, Israel.
J Prim Care Community Health. 2025 Jan-Dec;16:21501319251326663. doi: 10.1177/21501319251326663. Epub 2025 Mar 25.
To compare the diagnostic accuracy and clinical decision-making of experienced community nurses versus state-of-the-art generative AI (GenAI) systems for simulated patient case scenarios.
In the months of 5 to 6/2024, 114 community Israeli nurses completed a questionnaire including 4 medical case studies. Responses were also collected from 3 GenAI models (ChatGPT-4, Claude 3.0, and Gemini 1.5), analyzed both without word limits and with a 10-word constraint. Responses were scored on accuracy, speed, and comprehensiveness.
Nurses scored higher on average compared to the shortened GenAI responses. GenAI responses were faster but more verbose, and contained unnecessary information. Gemini (full version) and Claude (full version) achieved the highest accuracy among the GenAI models.
While GenAI shows potential to support aspects of nursing practice, human clinicians currently exhibit advantages in holistic clinical reasoning abilities, a skill requiring experience, contextual knowledge, and ability to bring concise and practical responses. Further research is needed before GenAI can adequately substitute nursing expertise.
比较经验丰富的社区护士与最先进的生成式人工智能(GenAI)系统在模拟患者病例场景中的诊断准确性和临床决策能力。
在2024年5月至6月期间,114名以色列社区护士完成了一份包含4个医学案例研究的问卷。还从3个GenAI模型(ChatGPT-4、Claude 3.0和Gemini 1.5)收集了回复,分别在无字数限制和10字限制的情况下进行分析。根据准确性、速度和全面性对回复进行评分。
与缩短后的GenAI回复相比,护士的平均得分更高。GenAI回复速度更快但更冗长,包含不必要的信息。在GenAI模型中,Gemini(完整版)和Claude(完整版)的准确性最高。
虽然GenAI显示出支持护理实践某些方面的潜力,但目前人类临床医生在整体临床推理能力方面具有优势,这是一种需要经验、情境知识以及给出简洁实用回复能力的技能。在GenAI能够充分替代护理专业知识之前,还需要进一步研究。