Department of Cardiology, Peking University China-Japan Friendship School of Clinical Medicine, Beijing, 100029, China.
Department of Cardiology, China-Japan Friendship Hospital, 2 Yinghua Dongjie, Chaoyang District, Beijing, 100029, China.
Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x.
Chronic kidney disease (CKD) patients with coronary artery disease (CAD) in the intensive care unit (ICU) have higher in-hospital mortality and poorer prognosis than patients with either single condition. The objective of this study is to develop a novel model that can predict the in-hospital mortality of that kind of patient in the ICU using machine learning methods.
Data of CKD patients with CAD were extracted from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database. Boruta algorithm was conducted for the feature selection process. Eight machine learning algorithms, such as logistic regression (LR), random forest (RF), Decision Tree, K-nearest neighbors (KNN), Gradient Boosting Decision Tree Machine (GBDT), Support Vector Machine (SVM), Neural Network (NN), and Extreme Gradient Boosting (XGBoost), were conducted to construct the predictive model for in-hospital mortality and performance was evaluated by average precision (AP) and area under the receiver operating characteristic curve (AUC). Shapley Additive Explanations (SHAP) algorithm was applied to explain the model visually. Moreover, data from the Telehealth Intensive Care Unit Collaborative Research Database (eICU-CRD) were acquired as an external validation set.
3590 and 1657 CKD patients with CAD were acquired from MIMIC-IV and eICU-CRD databases, respectively. A total of 78 variables were selected for the machine learning model development process. Comparatively, GBDT had the highest predictive performance according to the results of AUC (0.946) and AP (0.778). The SHAP method reveals the top 20 factors based on the importance ranking. In addition, GBDT had good predictive value and a certain degree of clinical value in the external validation according to the AUC (0.865), AP (0.672), decision curve analysis, and calibration curve.
Machine learning algorithms, especially GBDT, can be reliable tools for accurately predicting the in-hospital mortality risk for CKD patients with CAD in the ICU. This contributed to providing optimal resource allocation and reducing in-hospital mortality by tailoring precise management and implementation of early interventions.
重症监护病房(ICU)中合并冠心病(CAD)的慢性肾脏病(CKD)患者的院内死亡率和预后较单一疾病患者更高。本研究旨在使用机器学习方法为该类患者建立一种预测 ICU 内院内死亡率的新模型。
从医疗信息重症监护 IV 数据库(MIMIC-IV)中提取 CKD 合并 CAD 患者的数据。使用 Boruta 算法进行特征选择。采用逻辑回归(LR)、随机森林(RF)、决策树、K 最近邻(KNN)、梯度提升决策树机(GBDT)、支持向量机(SVM)、神经网络(NN)和极端梯度提升(XGBoost)等 8 种机器学习算法构建预测模型,并通过平均精度(AP)和接受者操作特征曲线下面积(AUC)评估性能。应用 Shapley 加法解释(SHAP)算法对模型进行可视化解释。此外,还获取了 Telehealth ICU 协作研究数据库(eICU-CRD)的数据作为外部验证集。
从 MIMIC-IV 和 eICU-CRD 数据库中分别获得 3590 例和 1657 例 CKD 合并 CAD 患者。共选择了 78 个变量进行机器学习模型开发过程。根据 AUC(0.946)和 AP(0.778)的结果,GBDT 的预测性能最高。SHAP 方法根据重要性排名显示前 20 个因素。此外,根据 AUC(0.865)、AP(0.672)、决策曲线分析和校准曲线,GBDT 在外部验证中具有良好的预测价值和一定的临床价值。
机器学习算法,特别是 GBDT,可作为准确预测 ICU 中 CKD 合并 CAD 患者院内死亡率风险的可靠工具。这有助于通过制定精确的管理策略和实施早期干预措施,为优化资源分配和降低院内死亡率提供参考。