Lin Qingqing, Zhao Wenxiang, Zhang Hailin, Chen Wenhao, Lian Sheng, Ruan Qinyun, Qu Zhaoyang, Lin Yimin, Chai Dajun, Lin Xiaoyan
Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China.
National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China.
Front Cardiovasc Med. 2025 Jan 24;12:1444323. doi: 10.3389/fcvm.2025.1444323. eCollection 2025.
Early prediction of heart failure (HF) after acute myocardial infarction (AMI) is essential for personalized treatment. We aimed to use interpretable machine learning (ML) methods to develop a risk prediction model for HF in AMI patients.
We retrospectively included patients initially with AMI who received percutaneous coronary intervention (PCI) in our hospital from November 2016 to February 2020. The primary endpoint was the occurrence of HF within 3 years after operation. For developing a predictive model for HF risk in AMI patients, the least absolute shrinkage and selection operator (LASSO) Regression was used to feature selection, and four ML algorithms including Random Forest (RF), Extreme Gradient Boost (XGBoost), Support Vector Machine (SVM), and Logistic Regression (LR) were employed to develop the model on the training set. The performance evaluation of the prediction model was carried out on the training set and the testing set, utilizing metrics including AUC (Area under the receiver operating characteristic curve), calibration plot, and decision curve analysis (DCA). In addition, we used the Shapley Additive Explanations (SHAP) value to determine the importance of the selected features and interpret the optimal model.
A total of 1220 AMI patients were included and 244 (20%) patients developed HF during follow-up. Among the four evaluated ML models, the XGBoost model exhibited exceptional accuracy, with an AUC value of 0.922. The SHAP method showed that left ventricular ejection fraction (LVEF), left ventricular end-systolic diameter (LVDs) and lactate dehydrogenase (LDH) were identified as the three most important characteristics to predict HF risk in AMI patients. Individual risk assessment was performed using SHAP plots and waterfall plot analysis.
Our research demonstrates the potential of ML methods in the early prediction of HF risk in AMI patients. Furthermore, it enhances the interpretability of the XGBoost model through SHAP analysis to guide clinical decision-making.
急性心肌梗死(AMI)后心力衰竭(HF)的早期预测对于个性化治疗至关重要。我们旨在使用可解释的机器学习(ML)方法来开发AMI患者HF的风险预测模型。
我们回顾性纳入了2016年11月至2020年2月在我院接受经皮冠状动脉介入治疗(PCI)的初发AMI患者。主要终点是术后3年内发生HF。为了建立AMI患者HF风险的预测模型,使用最小绝对收缩和选择算子(LASSO)回归进行特征选择,并采用包括随机森林(RF)、极端梯度提升(XGBoost)、支持向量机(SVM)和逻辑回归(LR)在内的四种ML算法在训练集上建立模型。利用受试者操作特征曲线下面积(AUC)、校准图和决策曲线分析(DCA)等指标在训练集和测试集上对预测模型进行性能评估。此外,我们使用Shapley加性解释(SHAP)值来确定所选特征的重要性并解释最佳模型。
共纳入1220例AMI患者,244例(20%)患者在随访期间发生HF。在四个评估的ML模型中,XGBoost模型表现出卓越的准确性,AUC值为0.922。SHAP方法显示左心室射血分数(LVEF)、左心室收缩末期内径(LVDs)和乳酸脱氢酶(LDH)被确定为预测AMI患者HF风险的三个最重要特征。使用SHAP图和瀑布图分析进行个体风险评估。
我们的研究证明了ML方法在AMI患者HF风险早期预测中的潜力。此外,它通过SHAP分析增强了XGBoost模型的可解释性,以指导临床决策。