Suppr超能文献

使用全电子病历机器学习对医院再入院率进行预测建模:以西奈山心力衰竭队列为例的研究

PREDICTIVE MODELING OF HOSPITAL READMISSION RATES USING ELECTRONIC MEDICAL RECORD-WIDE MACHINE LEARNING: A CASE-STUDY USING MOUNT SINAI HEART FAILURE COHORT.

作者信息

Shameer Khader, Johnson Kipp W, Yahi Alexandre, Miotto Riccardo, Li L I, Ricks Doran, Jebakaran Jebakumar, Kovatch Patricia, Sengupta Partho P, Gelijns Sengupta, Moskovitz Alan, Darrow Bruce, David David L, Kasarskis Andrew, Tatonetti Nicholas P, Pinney Sean, Dudley Joel T

机构信息

Department of Genetics and Genomics, Icahn Institute of Genomics and Multiscale Biology, New York, NY, USA2Institute of Next Generation Healthcare, Mount Sinai Health System, New York, NY, USA.

出版信息

Pac Symp Biocomput. 2017;22:276-287. doi: 10.1142/9789813207813_0027.

Abstract

Reduction of preventable hospital readmissions that result from chronic or acute conditions like stroke, heart failure, myocardial infarction and pneumonia remains a significant challenge for improving the outcomes and decreasing the cost of healthcare delivery in the United States. Patient readmission rates are relatively high for conditions like heart failure (HF) despite the implementation of high-quality healthcare delivery operation guidelines created by regulatory authorities. Multiple predictive models are currently available to evaluate potential 30-day readmission rates of patients. Most of these models are hypothesis driven and repetitively assess the predictive abilities of the same set of biomarkers as predictive features. In this manuscript, we discuss our attempt to develop a data-driven, electronic-medical record-wide (EMR-wide) feature selection approach and subsequent machine learning to predict readmission probabilities. We have assessed a large repertoire of variables from electronic medical records of heart failure patients in a single center. The cohort included 1,068 patients with 178 patients were readmitted within a 30-day interval (16.66% readmission rate). A total of 4,205 variables were extracted from EMR including diagnosis codes (n=1,763), medications (n=1,028), laboratory measurements (n=846), surgical procedures (n=564) and vital signs (n=4). We designed a multistep modeling strategy using the Naïve Bayes algorithm. In the first step, we created individual models to classify the cases (readmitted) and controls (non-readmitted). In the second step, features contributing to predictive risk from independent models were combined into a composite model using a correlation-based feature selection (CFS) method. All models were trained and tested using a 5-fold cross-validation method, with 70% of the cohort used for training and the remaining 30% for testing. Compared to existing predictive models for HF readmission rates (AUCs in the range of 0.6-0.7), results from our EMR-wide predictive model (AUC=0.78; Accuracy=83.19%) and phenome-wide feature selection strategies are encouraging and reveal the utility of such datadriven machine learning. Fine tuning of the model, replication using multi-center cohorts and prospective clinical trial to evaluate the clinical utility would help the adoption of the model as a clinical decision system for evaluating readmission status.

摘要

减少由中风、心力衰竭、心肌梗死和肺炎等慢性或急性疾病导致的可预防的医院再入院率,仍然是美国改善医疗结果和降低医疗服务成本的一项重大挑战。尽管监管机构制定了高质量的医疗服务操作指南,但心力衰竭(HF)等疾病的患者再入院率相对较高。目前有多种预测模型可用于评估患者30天的潜在再入院率。这些模型大多是假设驱动的,并反复评估同一组生物标志物作为预测特征的预测能力。在本论文中,我们讨论了我们尝试开发一种数据驱动的、全电子病历(EMR-wide)的特征选择方法以及后续机器学习来预测再入院概率的过程。我们评估了来自单一中心心力衰竭患者电子病历的大量变量。该队列包括1068名患者,其中178名患者在30天内再次入院(再入院率为16.66%)。从电子病历中总共提取了4205个变量,包括诊断代码(n = 1763)、药物(n = 1028)、实验室测量值(n = 846)、手术程序(n = 564)和生命体征(n = 4)。我们使用朴素贝叶斯算法设计了一种多步骤建模策略。第一步,我们创建个体模型来对病例(再入院)和对照(未再入院)进行分类。第二步,使用基于相关性的特征选择(CFS)方法将独立模型中对预测风险有贡献的特征组合成一个复合模型。所有模型均使用5折交叉验证方法进行训练和测试,队列的70%用于训练,其余30%用于测试。与现有的HF再入院率预测模型(AUC范围为0.6 - 0.7)相比,我们的全电子病历预测模型(AUC = 0.78;准确率 = 83.19%)和全表型特征选择策略的结果令人鼓舞,并揭示了这种数据驱动的机器学习的效用。对模型进行微调、使用多中心队列进行复制以及进行前瞻性临床试验以评估临床效用,将有助于该模型作为评估再入院状态的临床决策系统被采用。

相似文献

8

引用本文的文献

本文引用的文献

6
Introducing Machine Learning Concepts with WEKA.使用WEKA介绍机器学习概念。
Methods Mol Biol. 2016;1418:353-78. doi: 10.1007/978-1-4939-3578-9_17.
8
A comparison of models for predicting early hospital readmissions.预测早期医院再入院的模型比较。
J Biomed Inform. 2015 Aug;56:229-38. doi: 10.1016/j.jbi.2015.05.016. Epub 2015 Jun 1.
10
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验