Suppr超能文献

使用电子健康记录数据的自然语言处理验证危重病预后预测模型。

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data.

机构信息

Philip R. Lee Institute for Health Policy Studies, School of Medicine, University of California, San Francisco.

Center for Healthcare Value, University of California, San Francisco.

出版信息

JAMA Netw Open. 2018 Dec 7;1(8):e185097. doi: 10.1001/jamanetworkopen.2018.5097.

Abstract

IMPORTANCE

Accurate prediction of outcomes among patients in intensive care units (ICUs) is important for clinical research and monitoring care quality. Most existing prediction models do not take full advantage of the electronic health record, using only the single worst value of laboratory tests and vital signs and largely ignoring information present in free-text notes. Whether capturing more of the available data and applying machine learning and natural language processing (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown.

OBJECTIVES

To evaluate the change in power for a mortality prediction model among patients in the ICU achieved by incorporating measures of clinical trajectory together with NLP of clinical text and to assess the generalizability of this approach.

DESIGN, SETTING, AND PARTICIPANTS: This retrospective cohort study included 101 196 patients with a first-time admission to the ICU and a length of stay of at least 4 hours. Twenty ICUs at 2 academic medical centers (University of California, San Francisco [UCSF], and Beth Israel Deaconess Medical Center [BIDMC], Boston, Massachusetts) and 1 community hospital (Mills-Peninsula Medical Center [MPMC], Burlingame, California) contributed data from January 1, 2001, through June 1, 2017. Data were analyzed from July 1, 2017, through August 1, 2018.

MAIN OUTCOMES AND MEASURES

In-hospital mortality and model discrimination as assessed by the area under the receiver operating characteristic curve (AUC) and model calibration as assessed by the modified Hosmer-Lemeshow statistic.

RESULTS

Among 101 196 patients included in the analysis, 51.3% (n = 51 899) were male, with a mean (SD) age of 61.3 (17.1) years; their in-hospital mortality rate was 10.4% (n = 10 505). A baseline model using only the highest and lowest observed values for each laboratory test result or vital sign achieved a cross-validated AUC of 0.831 (95% CI, 0.830-0.832). In contrast, that model augmented with measures of clinical trajectory achieved an AUC of 0.899 (95% CI, 0.896-0.902; P < .001 for AUC difference). Further augmenting this model with NLP-derived terms associated with mortality further increased the AUC to 0.922 (95% CI, 0.916-0.924; P < .001). These NLP-derived terms were associated with improved model performance even when applied across sites (AUC difference for UCSF: 0.077 to 0.021; AUC difference for MPMC: 0.071 to 0.051; AUC difference for BIDMC: 0.035 to 0.043; P < .001) when augmenting with NLP at each site.

CONCLUSIONS AND RELEVANCE

Intensive care unit mortality prediction models incorporating measures of clinical trajectory and NLP-derived terms yielded excellent predictive performance and generalized well in this sample of hospitals. The role of these automated algorithms, particularly those using unstructured data from notes and other sources, in clinical research and quality improvement seems to merit additional investigation.

摘要

重要性

准确预测重症监护病房(ICU)患者的结局对于临床研究和监测护理质量非常重要。大多数现有的预测模型并没有充分利用电子健康记录,仅使用实验室检测和生命体征的单个最差值,并且在很大程度上忽略了自由文本记录中存在的信息。利用更多的可用数据并应用机器学习和自然语言处理(NLP)是否可以改善和自动化 ICU 患者的预后预测结果仍不清楚。

目的

评估通过整合临床轨迹测量值和临床文本的 NLP 来提高 ICU 患者死亡率预测模型的能力,并评估该方法的泛化能力。

设计、设置和参与者:这项回顾性队列研究纳入了 2001 年 1 月 1 日至 2017 年 6 月 1 日期间首次入住 ICU 且入住时间至少 4 小时的 101196 名患者。来自加利福尼亚大学旧金山分校(UCSF)和贝斯以色列女执事医疗中心(BIDMC,马萨诸塞州波士顿)的 20 个 ICU 和加利福尼亚州米尔皮塔斯医疗中心(MPMC,伯林盖姆)提供了数据。数据分析于 2017 年 7 月 1 日至 2018 年 8 月 1 日进行。

主要结局和测量指标

院内死亡率和接受者操作特征曲线(ROC)下面积(AUC)评估的模型区分度,以及改良 Hosmer-Lemeshow 统计量评估的模型校准度。

结果

在纳入分析的 101196 名患者中,51.3%(n=51899)为男性,平均(SD)年龄为 61.3(17.1)岁;他们的院内死亡率为 10.4%(n=10505)。仅使用每个实验室检测结果或生命体征的最高和最低观察值的基线模型,经交叉验证后的 AUC 为 0.831(95%CI,0.830-0.832)。相比之下,通过增加临床轨迹测量值来增强该模型,AUC 可达到 0.899(95%CI,0.896-0.902;P<0.001)。进一步通过 NLP 分析与死亡率相关的术语来增强该模型,可将 AUC 提高至 0.922(95%CI,0.916-0.924;P<0.001)。即使在每个站点应用 NLP 时,这些 NLP 分析的术语也与改善模型性能相关(UCSF 的 AUC 差异:0.077 至 0.021;MPMC 的 AUC 差异:0.071 至 0.051;BIDMC 的 AUC 差异:0.035 至 0.043;P<0.001)。

结论和相关性

纳入临床轨迹和 NLP 分析术语的 ICU 死亡率预测模型表现出出色的预测性能,并且在该组医院中具有良好的泛化能力。这些自动化算法,尤其是那些使用来自记录和其他来源的非结构化数据的算法,在临床研究和质量改进中的作用似乎值得进一步研究。

相似文献

引用本文的文献

本文引用的文献

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验