Department of General Internal Medicine, Nagano Chuo Hospital, Nagano 380-0814, Japan.
Department of Diagnostic and Generalist Medicine, Dokkyo Medical University, Tochigi 321-0293, Japan.
Int J Environ Res Public Health. 2021 Feb 21;18(4):2086. doi: 10.3390/ijerph18042086.
The efficacy of artificial intelligence (AI)-driven automated medical-history-taking systems with AI-driven differential-diagnosis lists on physicians' diagnostic accuracy was shown. However, considering the negative effects of AI-driven differential-diagnosis lists such as omission (physicians reject a correct diagnosis suggested by AI) and commission (physicians accept an incorrect diagnosis suggested by AI) errors, the efficacy of AI-driven automated medical-history-taking systems without AI-driven differential-diagnosis lists on physicians' diagnostic accuracy should be evaluated.
The present study was conducted to evaluate the efficacy of AI-driven automated medical-history-taking systems with or without AI-driven differential-diagnosis lists on physicians' diagnostic accuracy.
This randomized controlled study was conducted in January 2021 and included 22 physicians working at a university hospital. Participants were required to read 16 clinical vignettes in which the AI-driven medical history of real patients generated up to three differential diagnoses per case. Participants were divided into two groups: with and without an AI-driven differential-diagnosis list.
There was no significant difference in diagnostic accuracy between the two groups (57.4% vs. 56.3%, respectively; = 0.91). Vignettes that included a correct diagnosis in the AI-generated list showed the greatest positive effect on physicians' diagnostic accuracy (adjusted odds ratio 7.68; 95% CI 4.68-12.58; < 0.001). In the group with AI-driven differential-diagnosis lists, 15.9% of diagnoses were omission errors and 14.8% were commission errors.
Physicians' diagnostic accuracy using AI-driven automated medical history did not differ between the groups with and without AI-driven differential-diagnosis lists.
已经证明了人工智能(AI)驱动的自动化病历采集系统与 AI 驱动的鉴别诊断列表结合使用对医生诊断准确性的有效性。然而,考虑到 AI 驱动的鉴别诊断列表可能会产生漏诊(医生拒绝 AI 建议的正确诊断)和误诊(医生接受 AI 建议的错误诊断)等负面影响,因此需要评估没有 AI 驱动的鉴别诊断列表的 AI 驱动的自动化病历采集系统对医生诊断准确性的效果。
本研究旨在评估具有或不具有 AI 驱动的鉴别诊断列表的 AI 驱动的自动化病历采集系统对医生诊断准确性的效果。
这是一项 2021 年 1 月进行的随机对照研究,纳入了一家大学医院的 22 名医生。参与者需要阅读 16 个临床案例,其中每个案例的 AI 驱动的真实患者病历都会生成最多三个鉴别诊断。参与者被分为两组:有和没有 AI 驱动的鉴别诊断列表。
两组的诊断准确性没有显著差异(分别为 57.4%和 56.3%, = 0.91)。AI 生成列表中包含正确诊断的案例对医生的诊断准确性有最大的积极影响(调整后的优势比 7.68;95%CI 4.68-12.58; < 0.001)。在有 AI 驱动的鉴别诊断列表的组中,15.9%的诊断为漏诊,14.8%为误诊。
在有和没有 AI 驱动的鉴别诊断列表的组中,医生使用 AI 驱动的自动化病历的诊断准确性没有差异。