Duke Institute for Health Innovation, Durham, North Carolina.
Duke University School of Medicine, Durham, North Carolina.
JAMA Netw Open. 2020 Feb 5;3(2):e1920733. doi: 10.1001/jamanetworkopen.2019.20733.
The ability to accurately predict in-hospital mortality for patients at the time of admission could improve clinical and operational decision-making and outcomes. Few of the machine learning models that have been developed to predict in-hospital death are both broadly applicable to all adult patients across a health system and readily implementable. Similarly, few have been implemented, and none have been evaluated prospectively and externally validated.
To prospectively and externally validate a machine learning model that predicts in-hospital mortality for all adult patients at the time of hospital admission and to design the model using commonly available electronic health record data and accessible computational methods.
DESIGN, SETTING, AND PARTICIPANTS: In this prognostic study, electronic health record data from a total of 43 180 hospitalizations representing 31 003 unique adult patients admitted to a quaternary academic hospital (hospital A) from October 1, 2014, to December 31, 2015, formed a training and validation cohort. The model was further validated in additional cohorts spanning from March 1, 2018, to August 31, 2018, using 16 122 hospitalizations representing 13 094 unique adult patients admitted to hospital A, 6586 hospitalizations representing 5613 unique adult patients admitted to hospital B, and 4086 hospitalizations representing 3428 unique adult patients admitted to hospital C. The model was integrated into the production electronic health record system and prospectively validated on a cohort of 5273 hospitalizations representing 4525 unique adult patients admitted to hospital A between February 14, 2019, and April 15, 2019.
The main outcome was in-hospital mortality. Model performance was quantified using the area under the receiver operating characteristic curve and area under the precision recall curve.
A total of 75 247 hospital admissions (median [interquartile range] patient age, 59.5 [29.0] years; 45.9% involving male patients) were included in the study. The in-hospital mortality rates for the training validation; retrospective validations at hospitals A, B, and C; and prospective validation cohorts were 3.0%, 2.7%, 1.8%, 2.1%, and 1.6%, respectively. The area under the receiver operating characteristic curves were 0.87 (95% CI, 0.83-0.89), 0.85 (95% CI, 0.83-0.87), 0.89 (95% CI, 0.86-0.92), 0.84 (95% CI, 0.80-0.89), and 0.86 (95% CI, 0.83-0.90), respectively. The area under the precision recall curves were 0.29 (95% CI, 0.25-0.37), 0.17 (95% CI, 0.13-0.22), 0.22 (95% CI, 0.14-0.31), 0.13 (95% CI, 0.08-0.21), and 0.14 (95% CI, 0.09-0.21), respectively.
Prospective and multisite retrospective evaluations of a machine learning model demonstrated good discrimination of in-hospital mortality for adult patients at the time of admission. The data elements, methods, and patient selection make the model implementable at a system level.
在患者入院时准确预测住院死亡率可以改善临床和运营决策及结果。已经开发出的用于预测住院死亡的少数机器学习模型,要么在整个医疗体系中广泛适用于所有成年患者,要么易于实施。同样,这些模型很少被实施,也没有被前瞻性地评估和外部验证。
前瞻性和外部验证一种用于预测所有成年患者入院时住院死亡率的机器学习模型,并使用常见的电子健康记录数据和可访问的计算方法来设计该模型。
设计、地点和参与者:在这项预后研究中,总共 43180 次住院的电子健康记录数据代表了 31003 名独特的成年患者,这些患者于 2014 年 10 月 1 日至 2015 年 12 月 31 日从一家四级学术医院(医院 A)入院,形成了一个训练和验证队列。该模型在另外三个队列中进一步得到验证,这三个队列分别是 2018 年 3 月 1 日至 2018 年 8 月 31 日期间使用 16122 次住院的电子健康记录数据,代表了 13094 名独特的成年患者从医院 A 入院;6586 次住院的电子健康记录数据,代表了 5613 名独特的成年患者从医院 B 入院;以及 4086 次住院的电子健康记录数据,代表了 3428 名独特的成年患者从医院 C 入院。该模型被集成到生产电子健康记录系统中,并在 2019 年 2 月 14 日至 2019 年 4 月 15 日期间对 5273 次住院的队列进行了前瞻性验证,这些住院代表了 4525 名独特的成年患者从医院 A 入院。
主要结果是住院死亡率。使用接受者操作特征曲线下的面积和精度召回曲线下的面积来量化模型性能。
共纳入 75247 次住院(中位数[四分位间距]患者年龄,59.5[29.0]岁;45.9%的患者为男性)。在训练验证、医院 A、B 和 C 的回顾性验证以及前瞻性验证队列中,住院死亡率分别为 3.0%、2.7%、1.8%、2.1%和 1.6%。接受者操作特征曲线下的面积分别为 0.87(95%CI,0.83-0.89)、0.85(95%CI,0.83-0.87)、0.89(95%CI,0.86-0.92)、0.84(95%CI,0.80-0.89)和 0.86(95%CI,0.83-0.90)。精度召回曲线下的面积分别为 0.29(95%CI,0.25-0.37)、0.17(95%CI,0.13-0.22)、0.22(95%CI,0.14-0.31)、0.13(95%CI,0.08-0.21)和 0.14(95%CI,0.09-0.21)。
对一种机器学习模型的前瞻性和多站点回顾性评估表明,该模型能够很好地区分成年患者入院时的住院死亡率。该模型的数据元素、方法和患者选择使其能够在系统层面上实施。