Song Pan, Liu Xinjun, Wang Liang, Tang Lu, Li Jing, Chen Qin, Liu Xiaoyu, Quan Xiaoyan, Niu Yuxin, Cui Chi, Shi Meihong
School of Nursing, Southwest Medical University, Luzhou, Sichuan Province, China.
Department of Vascular Surgery, Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan Province, China.
J Vasc Surg. 2025 May 21. doi: 10.1016/j.jvs.2025.05.022.
Major adverse cardiovascular events (MACE) are severe complications of peripheral arterial disease (PAD), associated with a poor prognosis and disease burden. Therefore, the early identification of high-risk individuals is of paramount importance. This study aimed to develop and validate an interpretable machine learning (ML)-based prediction model for MACE risk in patients with PAD.
This retrospective study included patients with PAD enrolled between January 2022 and December 2023, with follow-up had been completed by December 2024. The primary outcome was MACE, defined as a composite of myocardial infarction, stroke, and cardiovascular mortality, and patients were followed up for 12-24 months using nonoverlapping datasets from four centers: three for model training and internal validation and one for external validation. Feature selection was performed using univariate analysis, least absolute shrinkage and selection operator logistic regression, and a random Forest algorithm. Ten different ML algorithms were used to construct the risk prediction model. Model performance was evaluated based on discrimination and calibration. The SHapley Additive exPlanations method was used to visualize model features and individual case predictions. The final risk prediction model was presented as a web-based calculator.
This multicenter study involved both model development dataset (n = 1110) and external validation dataset (n = 448). Among the 1558 enrolled patients with PAD, 469 of 1558 patients (30.1%) experienced MACE. The incidence of MACE was higher in the training cohort (249/777 [32.0%]) compared with the internal validation cohort (102/333 [30.6%]) and external validation cohort (118/448 [26.3%]). The mean follow-up duration was 19.0 ± 11.3 months. Participants' mean age was 73.1 ± 10.8 years, with males comprising 70.0% of the patients (1091/1558). We developed ML models incorporating eight clinically significant variables, with Gradient Boosting demonstrating comparatively better performance by achieving area under the receiver operating characteristic curve values of 0.864 (95% confidence interval, 0.822-0.905) in internal validation cohort and 0.777 (95% confidence interval, 0.720-0.833) in external validation cohort. The key predictors included polyvascular disease, cerebrovascular disease, hemoglobin A1c, C-reactive protein, albumin, peripheral arterial surgery, coronary heart disease, and neutrophils.
The Gradient Boosting algorithm outperformed other models in predicting MACE risk in patients with PAD, with external validation confirming its clinical applicability. The SHapley Additive exPlanations framework and web-based calculator enhanced the model's interpretability, enabling clinicians to better understand the factors contributing to MACE. This tool potentially helps clinicians to identify MACE risk in patients with PAD and implement preventive measures more effectively.
主要不良心血管事件(MACE)是外周动脉疾病(PAD)的严重并发症,与不良预后和疾病负担相关。因此,早期识别高危个体至关重要。本研究旨在开发并验证一种基于可解释机器学习(ML)的PAD患者MACE风险预测模型。
这项回顾性研究纳入了2022年1月至2023年12月期间登记的PAD患者,截至2024年12月已完成随访。主要结局为MACE,定义为心肌梗死、中风和心血管死亡的综合结果,使用来自四个中心的非重叠数据集对患者进行12至24个月的随访:三个用于模型训练和内部验证,一个用于外部验证。使用单变量分析、最小绝对收缩和选择算子逻辑回归以及随机森林算法进行特征选择。使用十种不同的ML算法构建风险预测模型。基于区分度和校准评估模型性能。使用SHapley加法解释方法可视化模型特征和个体病例预测。最终的风险预测模型以基于网络的计算器形式呈现。
这项多中心研究涉及模型开发数据集(n = 1110)和外部验证数据集(n = 448)。在1558例登记的PAD患者中,1558例患者中有469例(30.1%)发生了MACE。与内部验证队列(102/333 [30.6%])和外部验证队列(118/448 [26.3%])相比,训练队列中MACE的发生率更高(249/777 [32.0%])。平均随访时间为19.0±11.3个月。参与者的平均年龄为73.1±10.8岁,男性占患者的70.0%(1091/1558)。我们开发了包含八个具有临床意义变量的ML模型,梯度提升在内部验证队列中实现了受试者操作特征曲线下面积值为0.864(95%置信区间,0.822 - 0.905),在外部验证队列中为0.777(95%置信区间,0.720 - 0.833),表现相对更好。关键预测因素包括多血管疾病、脑血管疾病、糖化血红蛋白、C反应蛋白、白蛋白、外周动脉手术、冠心病和中性粒细胞。
梯度提升算法在预测PAD患者的MACE风险方面优于其他模型,外部验证证实了其临床适用性。SHapley加法解释框架和基于网络的计算器增强了模型的可解释性,使临床医生能够更好地理解导致MACE的因素。该工具可能有助于临床医生识别PAD患者的MACE风险并更有效地实施预防措施。