Cai Hao, Shao Yue, Liu Xuan-Yu, Li Chang-Ying, Ran Hao-Yu, Shi Hao-Ming, Zhang Cheng, Wu Qing-Chen
Department of Cardiothoracic Surgery, The First Affiliated Hospital of Chongqing Medical University, No.1, Medical College Road, Yuzhong District, Chongqing, 400016, China.
Eur J Med Res. 2025 Apr 15;30(1):277. doi: 10.1186/s40001-025-02510-w.
This study aims to develop a reliable and interpretable predictive model for long-term survival in Type A aortic dissection (TAAD) patients, utilizing machine learning (ML) algorithms.
We retrospectively reviewed the clinical data of patients diagnosed with TAAD who underwent open surgical repair at the First Affiliated Hospital of Chongqing Medical University, from September 2017 to December 2020, and at the Chongqing University Central Hospital between October 2019 and April 2020. Cases with less than 20% missing data were imputed using random forest algorithms. To identify significant prognostic factors, we performed LASSO (Least Absolute Shrinkage and Selection Operator) Cox regression analysis, including preoperative blood markers, previous medical history and intraoperative condition. Based on the advantages of the model and the characteristics of the data set, we subsequently developed a machine learning-based prognostic model using Support Vector Machine (SVM) and evaluated its performance across key metrics. To further explain the decision-making process of the SVM model, we employed SHapley Additive exPlanation (SHAP) values for model interpretation.
A total of 171 patients with TAAD were included in model training and internal test groups; 73 patients with TAAD were included in external test group. Through LASSO Cox regression, univariate analysis, and clinical relevance assessment, seven feature variables were selected for modeling. Performance evaluation revealed that the SVM model showed excellent performance in both the training and test sets, with no significant overfitting, indicating strong clinical applicability. In the training set, the model achieved an AUC of 0.9137 (95% CI 0.9081-0.9203) and in the internal and external testing set, 0.8533 (95% CI 0.8503-0.8624) and 0.8770 (95% CI 0.8698-0.8982), respectively. The accuracy values were 0.8366, 0.8481 and 0.8030; precision values were 0.8696, 0.8374 and 0.8235; recall values were 0.8421, 0.7933 and 0.7651; F1 scores were 0.8290, 0.8148 and 0.7928; Brier scores were 0.1213, 0.1417 and 0.1323; average precision (AP) values were 0.9019, 0.8789 and 0.8548, respectively. SHAP analysis revealed that longer operation time, extended cardiopulmonary bypass (CPB) duration, prolonged aortic cross-clamp (ACC) time, advanced age, higher plasma transfusion volume, elevated serum creatinine and increased white blood cell (WBC) count significantly contributed to higher model predictions.
This study developed an interpretable predictive model based on the SVM algorithm to assess long-term survival in TAAD patients. The model demonstrated accuracy, precision, and robustness in identifying high-risk patients, providing reliable evidence for clinicians.
本研究旨在利用机器学习(ML)算法,为A型主动脉夹层(TAAD)患者的长期生存建立一个可靠且可解释的预测模型。
我们回顾性分析了2017年9月至2020年12月在重庆医科大学附属第一医院以及2019年10月至2020年4月在重庆大学附属中心医院接受开放手术修复的TAAD患者的临床资料。数据缺失少于20%的病例采用随机森林算法进行插补。为了确定显著的预后因素,我们进行了LASSO(最小绝对收缩和选择算子)Cox回归分析,包括术前血液指标、既往病史和术中情况。基于模型的优势和数据集的特点,我们随后使用支持向量机(SVM)开发了一个基于机器学习的预后模型,并通过关键指标评估其性能。为了进一步解释SVM模型的决策过程,我们采用SHapley加性解释(SHAP)值进行模型解释。
共有171例TAAD患者纳入模型训练组和内部测试组;73例TAAD患者纳入外部测试组。通过LASSO Cox回归、单因素分析和临床相关性评估,选择了7个特征变量进行建模。性能评估显示,SVM模型在训练集和测试集中均表现出色,无明显过拟合,表明具有较强的临床适用性。在训练集中,模型的AUC为0.9137(95%CI 0.9081 - 0.9203),在内部和外部测试集中分别为0.8533(95%CI 0.8503 - 0.8624)和0.8770(95%CI 0.8698 - 0.8982)。准确率分别为0.8366、0.8481和0.8030;精确率分别为0.8696、0.8374和0.8235;召回率分别为0.8421、0.7933和0.7651;F1分数分别为0.8290、0.8148和0.7928;布里尔分数分别为0.1213、0.1417和0.1323;平均精度(AP)值分别为0.9019、0.8789和0.8548。SHAP分析显示,手术时间延长、体外循环(CPB)时间延长、主动脉阻断(ACC)时间延长、年龄较大、血浆输注量增加、血清肌酐升高和白细胞(WBC)计数增加显著导致模型预测值升高。
本研究基于SVM算法开发了一个可解释的预测模型,用于评估TAAD患者的长期生存。该模型在识别高危患者方面显示出准确性、精确性和稳健性,为临床医生提供了可靠的依据。