Xu Kai, Shan Lingtong, Bai Yun, Shi Yu, Lv Mengwei, Li Wei, Dai Huangdong, Zhang Xiaobin, Wang Zhenhua, Li Zhi, Li Mingliang, Zhao Xin, Zhang Yangyang
Department of Cardiovascular Surgery, Qilu Hospital of Shandong University, Jinan, Shandong, PR China.
Institute of Thoracoscopy in Cardiac Surgery, Qilu Hospital of Shandong University, Jinan, Shandong, PR China.
Heliyon. 2024 Sep 26;10(19):e38435. doi: 10.1016/j.heliyon.2024.e38435. eCollection 2024 Oct 15.
Machine learning (ML) has excelled after being introduced into the medical field. Ensemble ML models were able to integrate the advantages of several different ML models. This study compares the ensemble ML model's and EuroSCORE II's performance predicting in-hospital mortality in patients undergoing coronary artery bypass grafting surgery.
The study included 4,764 patients from three heart centers between January 2007 and December 2021. Of these, 3812 patients were assigned to the modeling group, and 952 patients were assigned to the internal test group. Patients from other two heart center (1733 and 415 cases, respectively) constituted the external test group. The modeling set data are trained using each of the three ML strategies (XGBoost, CatBoost, and LightGBM), and the new model (XCL model) is constructed by integrating these three models through an ensemble ML strategy. Performance of different models in the three test groups comparative assessments were performed by calibration, discriminant, decision curve analysis, net reclassification index (NRI), integrated discriminant improvement (IDI), and Bland-Altman analysis.
In terms of discrimination, the XCL model performed the best with an impressive AUC value of 0.9145 in the internal validation group. The XCL model continued to perform best in both external test groups. The NRI and IDI suggested that the ML model showed positive improvements in all three test groups compared to EuroSCORE II.
ML models, particularly the XCL model, outperformed EuroSCORE II in predicting in-hospital mortality for CABG patients, with better discrimination, calibration, and clinical utility.
机器学习(ML)被引入医学领域后表现出色。集成机器学习模型能够整合几种不同机器学习模型的优势。本研究比较了集成机器学习模型和欧洲心脏手术风险评估系统II(EuroSCORE II)在预测冠状动脉搭桥手术患者院内死亡率方面的性能。
该研究纳入了2007年1月至2021年12月期间来自三个心脏中心的4764例患者。其中,3812例患者被分配到建模组,952例患者被分配到内部测试组。来自其他两个心脏中心的患者(分别为1733例和415例)构成外部测试组。使用三种机器学习策略(XGBoost、CatBoost和LightGBM)分别对建模集数据进行训练,并通过集成机器学习策略将这三个模型整合构建新模型(XCL模型)。通过校准、判别、决策曲线分析、净重新分类指数(NRI)、综合判别改善(IDI)和Bland-Altman分析对三个测试组中不同模型的性能进行比较评估。
在判别方面,XCL模型表现最佳,在内部验证组中的AUC值令人印象深刻,为0.9145。XCL模型在两个外部测试组中也继续表现最佳。NRI和IDI表明,与EuroSCORE II相比,机器学习模型在所有三个测试组中均显示出积极的改善。
机器学习模型,尤其是XCL模型,在预测冠状动脉搭桥手术患者的院内死亡率方面优于EuroSCORE II,具有更好的判别能力、校准能力和临床实用性。