Duan Furui, Zhang Minghui, Yang Chunyan, Wang Xuewei, Wang Dalong
PET/CT Department, The Second Affiliated Hospital of Harbin Medical University, 246 Xuefu Road, Nangang District, Harbin, Heilongjiang, PR China.
Department of Respiratory Medicine, Affiliated Cancer Hospital of Harbin Medical University, 150 Haping Road, Nangang District, Harbin, Heilongjiang Province, PR China.
Acad Radiol. 2025 Mar;32(3):1645-1655. doi: 10.1016/j.acra.2024.11.037. Epub 2024 Dec 10.
This study aimed to develop and evaluate a machine learning model combining clinical, radiomics, and deep learning features derived from PET/CT imaging to predict lymph node metastasis (LNM) in patients with non-small cell lung cancer (NSCLC). The model's interpretability was enhanced using Shapley additive explanations (SHAP).
A total of 248 NSCLC patients who underwent preoperative PET/CT scans were included and divided into training, test, and external validation sets. Radiomics features were extracted from segmented tumor regions on PET/CT images, and deep learning features were generated using the ResNet50 architecture. Feature selection was performed using minimum-redundancy maximum-relevance (mRMR), and the least absolute shrinkage and selection operator (LASSO) algorithm. Four models-clinical, radiomics, deep learning radiomics (DL_radiomics), and combined model-were constructed using the XGBoost algorithm and evaluated based on diagnostic performance metrics, including area under the receiver operating characteristic curve (AUC), accuracy, F1 score, sensitivity, and specificity. Shapley Additive exPlanations (SHAP) was used for model interpretability.
The combined model achieved the highest AUC in the test set (AUC=0.853), outperforming the clinical (AUC=0.758), radiomics (AUC=0.831), and DL_radiomics (AUC=0.834) models. Decision curve analysis (DCA) demonstrated that the combined model offered greater clinical net benefits. SHAP was used for global interpretation, and the summary plot indicated that the features ct_original_glrlm_LongRunHighGrayLevelEmphasis, and pet_gradient_glcm_lmc1 were the most important for the model's predictions.
The combined model, combining clinical, radiomics, and deep learning features from PET/CT, significantly improved the accuracy of LNM prediction in NSCLC patients. SHAP-based interpretability provided valuable insights into the model's decision-making process, enhancing its potential clinical application for preoperative decision-making in NSCLC.
本研究旨在开发并评估一种机器学习模型,该模型结合了从PET/CT成像中提取的临床、影像组学和深度学习特征,以预测非小细胞肺癌(NSCLC)患者的淋巴结转移(LNM)。使用Shapley加法解释(SHAP)增强了模型的可解释性。
共纳入248例接受术前PET/CT扫描的NSCLC患者,并将其分为训练集、测试集和外部验证集。从PET/CT图像上分割出的肿瘤区域中提取影像组学特征,并使用ResNet50架构生成深度学习特征。使用最小冗余最大相关(mRMR)和最小绝对收缩与选择算子(LASSO)算法进行特征选择。使用XGBoost算法构建了四个模型——临床模型、影像组学模型、深度学习影像组学(DL_影像组学)模型和联合模型,并基于诊断性能指标进行评估,包括受试者操作特征曲线下面积(AUC)、准确率、F1分数、敏感性和特异性。使用Shapley加法解释(SHAP)来实现模型的可解释性。
联合模型在测试集中获得了最高的AUC(AUC = 0.853),优于临床模型(AUC = 0.758)、影像组学模型(AUC = 0.831)和DL_影像组学模型(AUC = 0.834)。决策曲线分析(DCA)表明联合模型具有更大的临床净效益。使用SHAP进行全局解释,汇总图表明特征ct_original_glrlm_LongRunHighGrayLevelEmphasis和pet_gradient_glcm_lmc1对模型预测最为重要。
结合PET/CT的临床、影像组学和深度学习特征的联合模型显著提高了NSCLC患者LNM预测的准确性。基于SHAP的可解释性为模型的决策过程提供了有价值的见解,增强了其在NSCLC术前决策中的潜在临床应用价值。