Chen Shaolin, Deng Ting, Yang Qing, Li Jin, Shen Juanyan, Luo Xu, Tang Juan, Zhang Xulian, Salvador Jordan Tovera, Ma Junliang
Nursing Department, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou Province, China.
School of Nursing, Zunyi Medical University, Zunyi, Guizhou Province, China.
EClinicalMedicine. 2025 Aug 1;86:103386. doi: 10.1016/j.eclinm.2025.103386. eCollection 2025 Aug.
Early identification and prediction of postoperative pulmonary complications (PPCs) are vital for patient management in lung cancer (LC) surgery. However, existing predictive models often lack comprehensive validation and interpretability. This study aimed to develop and validate an explainable machine learning (ML) model to predict PPCs in patients with LC undergoing surgery.
A risk factor variable pool was determined by meta-analysis and Delphi surveys. Patients undergoing LC surgery who were admitted to the Thoracic Surgery Department at the Affiliated Hospital of Zunyi Medical University from 1st January 2022 to 31st October 2023 (retrospective) and from 1st November 2023 to 31st July 2024 (prospective) were used for model development and prospective validation, respectively. The retrospective cohort was randomly split into a training set and an internal validation set at an 8:2 ratio. Feature selection involved univariate analysis, collinearity analysis, nine ML algorithms, and expert consensus. Twelve independent ML models and 26 stacking ensemble models were developed. Predictive performance was evaluated using the area under the receiver-operating-characteristic curve (AUROC), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score. Prospective validation was analysed using AUC, Hosmer-Lemeshow test, calibration curves, and decision curve analysis (DCA). The Shapley Additive Explanation (SHAP) method was utilised to interpret the predictive model.
A total of 883 patients were included in the retrospective cohort with an incidence of PPCs of 35.4% (313/883), and a total of 308 patients were included in the prospective cohort with PPCs of 29.5% (91/308). Nine key characteristics were selected for model development: age, duration of surgery, Charson comorbidity index (CCI), tumour stage, measured carbon monoxide diffusion (DLCO, mmol/min/kPa), intra-operative infusion volume (IFIV, mL), red blood cell volume distribution width-coefficient of variation (RDW-CV, %), body mass index (BMI), and number of years of smoking. Amongst the independent models, the Gradient Boosting Decision Tree (GBDT) showed best performance, achieving an AUROC of 0.829 (95% CI: 0.774-0.885). The stacking ensemble combining Support Vector Machine (SVM) and Decision Tree (DT) showed the highest overall performance, with an AUROC of 0.860 (95% CI: 0.809-0.911), and DCA showed higher clinical utility compared to other models. In the prospective validation, the AUROC was 0.790 (95% CI: 0.744-0.835).
The stacking ensemble model combining SVM and DT demonstrated robust predictive performance and favourable clinical utility for prediction PPCs in patients undergoing LC surgery. However, the model has not been applied in clinical practice and requires future validation in large, multi-centre cohorts. Further work should aim to identify high-risk patients early through clinical data analysis, enabling timely interventions and more efficient allocation of limited healthcare resources.
The Science and Technology Foundation of Guizhou Provincial Health Commission; the Key Talent Team of Guizhou Provincial Science and Technology Innovation; and Guizhou Science and Technology Cooperation Basic Research Project.
早期识别和预测术后肺部并发症(PPCs)对于肺癌(LC)手术患者的管理至关重要。然而,现有的预测模型往往缺乏全面的验证和可解释性。本研究旨在开发并验证一种可解释的机器学习(ML)模型,以预测接受手术的LC患者发生PPCs的情况。
通过荟萃分析和德尔菲调查确定风险因素变量池。将2022年1月1日至2023年10月31日(回顾性)以及2023年11月1日至2024年7月31日(前瞻性)期间在遵义医科大学附属医院胸外科住院接受LC手术的患者分别用于模型开发和前瞻性验证。回顾性队列以8:2的比例随机分为训练集和内部验证集。特征选择涉及单因素分析、共线性分析、九种ML算法和专家共识。开发了12个独立的ML模型和26个堆叠集成模型。使用受试者操作特征曲线下面积(AUROC)、准确性、敏感性、特异性、阳性预测值(PPV)、阴性预测值(NPV)和F1分数评估预测性能。使用AUC、Hosmer-Lemeshow检验、校准曲线和决策曲线分析(DCA)进行前瞻性验证。采用Shapley加性解释(SHAP)方法解释预测模型。
回顾性队列共纳入883例患者,PPCs发生率为3\5.4%(313/883);前瞻性队列共纳入308例患者,PPCs发生率为29.5%(91/308)。选择九个关键特征用于模型开发:年龄、手术时长、查尔森合并症指数(CCI)、肿瘤分期、一氧化碳弥散量(DLCO,mmol/min/kPa)、术中输液量(IFIV,mL)、红细胞体积分布宽度变异系数(RDW-CV,%)、体重指数(BMI)和吸烟年限。在独立模型中,梯度提升决策树(GBDT)表现最佳,AUROC为0.829(95%CI:0\774-0.885)。支持向量机(SVM)和决策树(DT)相结合的堆叠集成模型总体性能最高,AUROC为0.860(95%CI:0.809-0.911),DCA显示其临床实用性高于其他模型。在前瞻性验证中,AUROC为0.790(95%CI:0.744-0.835)。
SVM和DT相结合的堆叠集成模型在预测接受LC手术患者发生PPCs方面表现出强大的预测性能和良好的临床实用性。然而,该模型尚未应用于临床实践,需要在大型多中心队列中进行未来验证。进一步的工作应旨在通过临床数据分析早期识别高危患者,以便及时进行干预并更有效地分配有限的医疗资源。
贵州省卫生健康委员会科学技术基金;贵州省科技创新重点人才团队;以及贵州省科技合作基础研究项目。