Kang Yanru, Li Mei, Xing Xizi, Qian Kaixuan, Liu Hongxia, Qi Yafei, Liu Yanguo, Cui Yi, Zhang Hua
School of Clinical and Basic Medicine, Shandong First Medical University, Jinan, 250117, China.
Department of Radiology, The Affiliated of Shandong Traditional Medical University, Jinan, 250011, China.
BMC Med Imaging. 2025 Jun 4;25(1):202. doi: 10.1186/s12880-025-01686-1.
This study aimed to develop and validate machine learning models for preoperative identification of metastasis to station 4 mediastinal lymph nodes (MLNM) in non-small cell lung cancer (NSCLC) patients at pathological N0-N2 (pN0-pN2) stage, thereby enhancing the precision of clinical decision-making.
We included a total of 356 NSCLC patients at pN0-pN2 stage, divided into training (n = 207), internal test (n = 90), and independent test (n = 59) sets. Station 4 mediastinal lymph nodes (LNs) regions of interest (ROIs) were semi-automatically segmented on venous-phase computed tomography (CT) images for radiomics feature extraction. Using least absolute shrinkage and selection operator (LASSO) regression to select features with non-zero coefficients. Four machine learning algorithms-decision tree (DT), logistic regression (LR), random forest (RF), and support vector machine (SVM)-were employed to construct radiomics models. Clinical predictors were identified through univariate and multivariate logistic regression, which were subsequently integrated with radiomics features to develop combined models. Models performance were evaluated using receiver operating characteristic (ROC) analysis, calibration curves, decision curve analysis (DCA), and DeLong's test.
Out of 1721 radiomics features, eight radiomics features were selected using LASSO regression. The RF-based combined model exhibited the strongest discriminative power, with an area under the curve (AUC) of 0.934 for the training set and 0.889 for the internal test set. The calibration curve and DCA further indicated the superior performance of the combined model based on RF. The independent test set further verified the model's robustness.
The combined model based on RF, integrating radiomics and clinical features, effectively and non-invasively identifies metastasis to the station 4 mediastinal LNs in NSCLC patients at pN0-pN2 stage. This model serves as an effective auxiliary tool for clinical decision-making and has the potential to optimize treatment strategies and improve prognostic assessment for pN0-pN2 patients.
Not applicable.
本研究旨在开发并验证机器学习模型,用于术前识别处于病理N0-N2(pN0-pN2)期的非小细胞肺癌(NSCLC)患者纵隔4区淋巴结转移(MLNM),从而提高临床决策的准确性。
我们纳入了总共356例pN0-pN2期的NSCLC患者,分为训练集(n = 207)、内部测试集(n = 90)和独立测试集(n = 59)。在静脉期计算机断层扫描(CT)图像上对纵隔4区淋巴结(LNs)感兴趣区域(ROIs)进行半自动分割,以提取影像组学特征。使用最小绝对收缩和选择算子(LASSO)回归来选择非零系数的特征。采用四种机器学习算法——决策树(DT)、逻辑回归(LR)、随机森林(RF)和支持向量机(SVM)——构建影像组学模型。通过单变量和多变量逻辑回归确定临床预测指标,随后将其与影像组学特征整合以开发联合模型。使用受试者工作特征(ROC)分析、校准曲线、决策曲线分析(DCA)和德龙检验来评估模型性能。
在1721个影像组学特征中,使用LASSO回归选择了8个影像组学特征。基于RF的联合模型表现出最强的判别能力,训练集的曲线下面积(AUC)为0.934,内部测试集为0.889。校准曲线和DCA进一步表明基于RF的联合模型具有优越性能。独立测试集进一步验证了模型的稳健性。
基于RF的联合模型整合了影像组学和临床特征,能够有效且无创地识别pN0-pN2期NSCLC患者纵隔4区LNs转移。该模型是临床决策的有效辅助工具,有可能优化治疗策略并改善pN0-pN2患者的预后评估。
不适用。