Department of Pulmonary and Critical Care Medicine, Third Affiliated Hospital of Naval Medical University, Shanghai, China.
Medicine (Baltimore). 2024 Aug 23;103(34):e39260. doi: 10.1097/MD.0000000000039260.
Postoperative pulmonary complications (PPCs) are a significant concern following lung resection due to prolonged hospital stays and increased morbidity and mortality among patients. This study aims to develop and validate a risk prediction model for PPCs after lung resection using the random forest (RF) algorithm to enhance early detection and intervention. Data from 180 patients who underwent lung resections at the Third Affiliated Hospital of the Naval Medical University between September 2022 and February 2024 were retrospectively analyzed. The patients were randomly allocated into a training set and a test set in an 8:2 ratio. An RF model was constructed using Python, with feature importance ranked based on the mean Gini index. The predictive performance of the model was evaluated through analyses of the receiver operating characteristic curve, calibration curve, and decision curve. Among the 180 patients included, 47 (26.1%) developed PPCs. The top 5 predictive factors identified by the RF model were blood loss, maximal length of resection, number of lymph nodes removed, forced expiratory volume in the first second as a percentage of predicted value, and age. The receiver operating characteristic curve and calibration curve analyses demonstrated favorable discrimination and calibration capabilities of the model, while decision curve analysis indicated its clinical applicability. The RF algorithm is effective in predicting PPCs following lung resection and holds promise for clinical application.
术后肺部并发症(PPCs)是肺切除术后的一个重要关注点,因为它会导致患者住院时间延长、发病率和死亡率增加。本研究旨在使用随机森林(RF)算法开发和验证一种用于肺切除术后 PPCs 的风险预测模型,以增强早期检测和干预的能力。对 2022 年 9 月至 2024 年 2 月在海军军医大学第三附属医院接受肺切除术的 180 例患者进行回顾性分析。将患者以 8:2 的比例随机分配到训练集和测试集中。使用 Python 构建 RF 模型,根据平均基尼指数对特征重要性进行排序。通过接受者操作特征曲线、校准曲线和决策曲线分析评估模型的预测性能。在纳入的 180 例患者中,有 47 例(26.1%)发生 PPCs。RF 模型确定的前 5 个预测因素是出血量、最大切除长度、切除的淋巴结数量、预计值的第一秒用力呼气量百分比和年龄。接收者操作特征曲线和校准曲线分析表明该模型具有良好的区分和校准能力,而决策曲线分析表明其具有临床适用性。RF 算法在预测肺切除术后 PPCs 方面非常有效,具有临床应用的潜力。