Suppr超能文献

利用机器学习自动检测传染病诊断错误

Automating detection of diagnostic error of infectious diseases using machine learning.

作者信息

Peterson Kelly S, Chapman Alec B, Widanagamaachchi Wathsala, Sutton Jesse, Ochoa Brennan, Jones Barbara E, Stevens Vanessa, Classen David C, Jones Makoto M

机构信息

Veterans Health Administration, Office of Analytics and Performance Integration, Washington D.C., District of Columbia, United States of America.

Department of Internal Medicine, Division of Epidemiology, University of Utah, Salt Lake City, Utah, United States of America.

出版信息

PLOS Digit Health. 2024 Jun 7;3(6):e0000528. doi: 10.1371/journal.pdig.0000528. eCollection 2024 Jun.

Abstract

Diagnostic error, a cause of substantial morbidity and mortality, is largely discovered and evaluated through self-report and manual review, which is costly and not suitable to real-time intervention. Opportunities exist to leverage electronic health record data for automated detection of potential misdiagnosis, executed at scale and generalized across diseases. We propose a novel automated approach to identifying diagnostic divergence considering both diagnosis and risk of mortality. Our objective was to identify cases of emergency department infectious disease misdiagnoses by measuring the deviation between predicted diagnosis and documented diagnosis, weighted by mortality. Two machine learning models were trained for prediction of infectious disease and mortality using the first 24h of data. Charts were manually reviewed by clinicians to determine whether there could have been a more correct or timely diagnosis. The proposed approach was validated against manual reviews and compared using the Spearman rank correlation. We analyzed 6.5 million ED visits and over 700 million associated clinical features from over one hundred emergency departments. The testing set performances of the infectious disease (Macro F1 = 86.7, AUROC 90.6 to 94.7) and mortality model (Macro F1 = 97.6, AUROC 89.1 to 89.1) were in expected ranges. Human reviews and the proposed automated metric demonstrated positive correlations ranging from 0.231 to 0.358. The proposed approach for diagnostic deviation shows promise as a potential tool for clinicians to find diagnostic errors. Given the vast number of clinical features used in this analysis, further improvements likely need to either take greater account of data structure (what occurs before when) or involve natural language processing. Further work is needed to explain the potential reasons for divergence and to refine and validate the approach for implementation in real-world settings.

摘要

诊断错误是导致大量发病和死亡的原因,目前主要通过自我报告和人工审查来发现和评估,这种方式成本高昂且不适用于实时干预。利用电子健康记录数据进行潜在误诊自动检测的机会是存在的,这种检测可以大规模执行并推广到各种疾病。我们提出了一种新颖的自动方法来识别诊断差异,同时考虑诊断和死亡风险。我们的目标是通过测量预测诊断与记录诊断之间的偏差(以死亡率加权)来识别急诊科传染病误诊病例。使用前24小时的数据训练了两个机器学习模型,分别用于预测传染病和死亡率。临床医生人工审查病历图表,以确定是否可能存在更正确或更及时的诊断。将所提出的方法与人工审查进行验证,并使用斯皮尔曼等级相关性进行比较。我们分析了来自一百多个急诊科的650万次急诊就诊以及超过7亿个相关临床特征。传染病模型(宏F1 = 86.7,曲线下面积90.6至94.7)和死亡率模型(宏F1 = 97.6,曲线下面积89.1至89.1)的测试集性能在预期范围内。人工审查与所提出的自动指标显示出0.231至0.358的正相关性。所提出的诊断偏差方法有望成为临床医生发现诊断错误的潜在工具。鉴于本分析中使用了大量临床特征,进一步的改进可能需要更多地考虑数据结构(事件发生的先后顺序)或涉及自然语言处理。需要进一步开展工作来解释差异的潜在原因,并完善和验证该方法以便在实际环境中实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19a6/11161023/559087739d4d/pdig.0000528.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验