Li Ben, Shaikh Farah, Younes Houssam, Abuhalimeh Batool, Zamzam Abdelrahman, Abdin Rawand, Qadura Mohammad
Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, University of Toronto, Toronto, ON M5B 1W8, Canada.
Department of Surgery, University of Toronto, Toronto, ON M5S 1A1, Canada.
Biomolecules. 2025 Jul 11;15(7):991. doi: 10.3390/biom15070991.
Peripheral artery disease (PAD) is associated with an elevated risk of major adverse cardiovascular events (MACE). Despite this, few reliable biomarkers exist to identify patients at heightened risk of MACE. Growth differentiation factor 15 (GDF15), a stress-responsive cytokine implicated in inflammation, atherosclerosis, and thrombosis, has been broadly studied in cardiovascular disease but remains underexplored in PAD. This study aimed to evaluate the prognostic utility of GDF15 for predicting 2-year MACE in PAD patients using explainable statistical and machine learning approaches. We conducted a prospective analysis of 1192 individuals (454 with PAD and 738 without PAD). At study entry, patient plasma GDF15 concentrations were measured using a validated multiplex immunoassay. The cohort was followed for two years to monitor the occurrence of MACE, defined as stroke, myocardial infarction, or death. Baseline GDF15 levels were compared between PAD and non-PAD participants using the Mann-Whitney U test. A machine learning model based on extreme gradient boosting (XGBoost) was trained to predict 2-year MACE using 10-fold cross-validation, incorporating GDF15 and clinical variables including age, sex, comorbidities (hypertension, diabetes, dyslipidemia, congestive heart failure, coronary artery disease, and previous stroke or transient ischemic attack), smoking history, and cardioprotective medication use. The model's primary evaluation metric was the F1 score, a validated measurement of the harmonic mean of the precision and recall values of the prediction model. Secondary model performance metrics included precision, recall, positive likelihood ratio (LR+), and negative likelihood ratio (LR-). A prediction probability histogram and Shapley additive explanations (SHAP) analysis were used to assess model discrimination and interpretability. The mean participant age was 70 ± SD 11 years, with 32% ( = 386) female representation. Median plasma GDF15 levels were significantly higher in PAD patients compared to the levels in non-PAD patients (1.29 [IQR 0.77-2.22] vs. 0.99 [IQR 0.61-1.63] pg/mL; < 0.001). During the 2-year follow-up period, 219 individuals (18.4%) experienced MACE. The XGBoost model demonstrated strong predictive performance for 2-year MACE (F1 score = 0.83; precision = 82.0%; recall = 83.7%; LR+ = 1.88; LR- = 0.83). The prediction histogram revealed distinct stratification between those who did vs. did not experience 2-year MACE. SHAP analysis identified GDF15 as the most influential predictive feature, surpassing traditional clinical predictors such as age, cardiovascular history, and smoking status. This study highlights GDF15 as a strong prognostic biomarker for 2-year MACE in patients with PAD. When combined with clinical variables in an interpretable machine learning model, GDF15 supports the early identification of patients at high risk for systemic cardiovascular events, facilitating personalized treatment strategies including multidisciplinary specialist referrals and aggressive cardiovascular risk reduction therapy. This biomarker-guided approach offers a promising pathway for improving cardiovascular outcomes in the PAD population through precision risk stratification.
外周动脉疾病(PAD)与主要不良心血管事件(MACE)风险升高相关。尽管如此,用于识别MACE风险升高患者的可靠生物标志物却很少。生长分化因子15(GDF15)是一种与炎症、动脉粥样硬化和血栓形成有关的应激反应细胞因子,已在心血管疾病中得到广泛研究,但在PAD中仍未得到充分探索。本研究旨在使用可解释的统计和机器学习方法评估GDF15对预测PAD患者2年MACE的预后效用。我们对1192名个体(454名患有PAD和738名未患有PAD)进行了前瞻性分析。在研究开始时,使用经过验证的多重免疫测定法测量患者血浆GDF15浓度。对该队列进行了两年的随访,以监测MACE的发生情况,MACE定义为中风、心肌梗死或死亡。使用Mann-Whitney U检验比较PAD和非PAD参与者的基线GDF15水平。基于极端梯度提升(XGBoost)训练了一个机器学习模型以使用10折交叉验证预测2年MACE,纳入了GDF15和临床变量,包括年龄、性别、合并症(高血压、糖尿病、血脂异常、充血性心力衰竭、冠状动脉疾病以及既往中风或短暂性脑缺血发作)、吸烟史和心脏保护药物使用情况。该模型的主要评估指标是F1分数,这是对预测模型的精确率和召回率值的调和平均数的有效测量。次要模型性能指标包括精确率、召回率、阳性似然比(LR+)和阴性似然比(LR-)。使用预测概率直方图和Shapley相加解释(SHAP)分析来评估模型的辨别力和可解释性。参与者的平均年龄为70±标准差11岁,女性占32%(n = 386)。与非PAD患者相比,PAD患者的血浆GDF15水平中位数显著更高(1.29 [四分位间距0.77 - 2.22] vs. 0.99 [四分位间距0.61 - 1.63] pg/mL;P < 0.001)。在2年随访期间,219名个体(18.4%)发生了MACE。XGBoost模型对2年MACE表现出强大的预测性能(F1分数 = 0.八三;精确率 = 82.0%;召回率 = 83.7%;LR+ = 1.88;LR- = 0.83)。预测直方图显示了经历与未经历2年MACE者之间的明显分层。SHAP分析确定GDF15是最具影响力的预测特征,超过了年龄、心血管病史和吸烟状况等传统临床预测因素。本研究强调GDF15是PAD患者2年MACE的强大预后生物标志物。当与临床变量结合在一个可解释的机器学习模型中时,GDF15有助于早期识别有全身性心血管事件高风险的患者,促进包括多学科专家转诊和积极的心血管风险降低治疗在内的个性化治疗策略。这种生物标志物引导的方法为通过精确风险分层改善PAD人群的心血管结局提供了一条有前景的途径。