Wu J Y, Lin Y, Lin K, Hu Y H, Kong G L
Department of Epidemiology and Biostatistics, Peking University School of Public Health, Beijing 100191, China.
Advanced Institute of Information Technology, Peking University, Hangzhou 311200, China.
Beijing Da Xue Xue Bao Yi Xue Ban. 2021 Dec 18;53(6):1163-1170. doi: 10.19723/j.issn.1671-167X.2021.06.026.
To construct length of intensive care unit (ICU) stay (LOS-ICU) prediction models for ICU patients, based on three machine learning models support vector machine (SVM), classification and regression tree (CART), and random forest (RF), and to compare the prediction perfor-mance of the three machine learning models with the customized simplified acute physiology score Ⅱ(SAPS-Ⅱ) model.
We used medical information mart for intensive care (MIMIC)-Ⅲ database for model development and validation. The primary outcome was prolonged LOS-ICU(pLOS-ICU), defined as longer than the third quartile of patients' LOS-ICU in the studied dataset. The recursive feature elimination method was used to do feature selection for three machine learning models. We utilized 5-fold cross validation to evaluate model prediction performance. The Brier value, area under the receiver operation characteristic curve (AUROC), and estimated calibration index (ECI) were used as perfor-mance measures. Performances of the four models were compared, and performance differences between the models were assessed using two-sided test. The model with the best prediction performance was employed to generate variable importance ranking, and the identified top five important predictors were pre-sented.
The final cohort in our study consisted of 40 200 eligible ICU patients, of whom 23.7% were with pLOS-ICU. The proportion of the male patients was 57.6%, and the age of all the ICU patients was (61.9±16.5) years.Results showed that the three machine learning models outperformed the customized SAPS-Ⅱ model in terms of all the performance measures with statistical significance ( < 0.01). Among the three machine learning models, the RF model achieved the best overall performance (Brier value, 0.145), discrimination (AUROC, 0.770) and calibration (ECI, 7.259). The calibration curve showed that the RF model slightly overestimated the risk of pLOS-ICU in high-risk ICU patients, but underestimated the risk of pLOS-ICU in low-risk ICU patients. Top five important predictors for pLOS-ICU identified by the RF model included age, heart rate, systolic blood pressure, body tempe-rature, and ratio of arterial oxygen tension to the fraction of inspired oxygen(PaO/FiO).
The RF algorithm-based pLOS-ICU prediction model had a best prediction performance in this study. It lays a foundation for future application of the RF-based pLOS-ICU prediction model in ICU clinical practice.
基于支持向量机(SVM)、分类与回归树(CART)和随机森林(RF)这三种机器学习模型,构建重症监护病房(ICU)患者的住院时长(LOS - ICU)预测模型,并将这三种机器学习模型的预测性能与定制的简化急性生理学评分Ⅱ(SAPS - Ⅱ)模型进行比较。
我们使用重症监护医学信息集市(MIMIC)-Ⅲ数据库进行模型开发和验证。主要结局为延长的ICU住院时长(pLOS - ICU),定义为研究数据集中患者LOS - ICU超过第三个四分位数。采用递归特征消除方法对三种机器学习模型进行特征选择。我们利用5折交叉验证来评估模型预测性能。使用Brier值、受试者操作特征曲线下面积(AUROC)和估计校准指数(ECI)作为性能指标。比较四种模型的性能,并使用双侧检验评估模型之间的性能差异。采用预测性能最佳的模型生成变量重要性排名,并列出确定的前五个重要预测因素。
我们研究中的最终队列包括402名符合条件的ICU患者,其中23.7%患有pLOS - ICU。男性患者比例为57.6%,所有ICU患者的年龄为(61.9±16.5)岁。结果表明,在所有性能指标方面,三种机器学习模型均优于定制的SAPS - Ⅱ模型,差异具有统计学意义(P<0.01)。在三种机器学习模型中,RF模型的总体性能最佳(Brier值为0.145)、区分度(AUROC为0.770)和校准度(ECI为7.259)。校准曲线显示,RF模型在高风险ICU患者中略微高估了pLOS - ICU的风险,但在低风险ICU患者中低估了pLOS - ICU的风险。RF模型确定的pLOS - ICU的前五个重要预测因素包括年龄、心率、收缩压、体温以及动脉血氧分压与吸入氧分数之比(PaO₂/FiO₂)。
基于RF算法的pLOS - ICU预测模型在本研究中具有最佳的预测性能。它为未来基于RF的pLOS - ICU预测模型在ICU临床实践中的应用奠定了基础。