Yao Zhiheng, Wang Yubo, Wu Yaping, Zhou Jinpeng, Dang Na, Wang Meiyun, Liang Ying, Sun Tao
Paul C. Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, People's Republic of China.
Department of Nuclear Medicine, National Cancer Center, National Clinical Research Center for Cancer, Cancer Hospital & Shenzhen Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Shenzhen, People's Republic of China.
Eur J Nucl Med Mol Imaging. 2025 Apr 4. doi: 10.1007/s00259-025-07231-0.
Dynamic 18F-fluorodeoxyglucose (18F-FDG) PET/CT imaging has been shown to provide additional information for diagnosing lung cancer. The aim of this study was to investigate whether metabolic and flow features directly extracted from time activity curves (TACs) help differentiate between benign and malignant conditions of lung lesions.
TACs at the primary lesion were extracted from each dynamic 18F-FDG PET/CT scan. The TAC signal was then decomposed into metabolism and blood flow components through kinetic modeling. Dynamic features including area under the curve (AUC), time-to-peak, and slopes were then extracted from each component. The extracted features from 187 patients (mean age, 60.41 ± 11.01 years; 117 males) were used to train a classification model based on bagging, a machine-learning method built with decision trees. The performance of the trained model on differentiating benign and malignant was tested using receiver operating characteristic analysis with cross-validation. External testing was then performed for an independent dataset that consisted of 42 dynamic scans. For the results, SHapley Additive exPlanations (SHAP) were used to assess the relative importance of the contributed features for individuals. Waterfall charts were also plotted, together with assessment of Cohen's effect size to demonstrate the superiority of the proposed model over SUVmax and the net FDG influx rate K.
The combination of the multiple dynamic features was able to separate benign and malignant lesions. For cross-validation, the trained model had an AUC of 0.89, sensitivity of 0.80, and specificity of 0.88, which was significantly higher than that of either SUVmax (AUC = 0.79, DeLong p < 0.001) or K (AUC = 0.76, DeLong test p < 0.001). For the testing dataset, the model had an AUC of 0.86, which again was better than either SUVmax (AUC of 0.72) or K (AUC of 0.71). The most important features that contributed to the diagnosis identified by SHAP included the slope and maximum metabolism TAC at the lesion, the AUC, and the peak time of blood TAC at the lesion. The waterfall chart illustrated that the model had significantly different prediction scores between the benign and malignant groups (p < 0.001) with a Cohen's effect size of 1.71, which was higher than that of the values for SUV and K (Cohen's effect size 0.96 and 0.81, respectively).
An explainable machine learning model that combines dynamic FDG metabolic and flow features can predict benign or malignant lung cancer patients more accurately than conventional parameters such as SUVmax or net influx rate K.
动态18F-氟脱氧葡萄糖(18F-FDG)PET/CT成像已被证明可为肺癌诊断提供额外信息。本研究的目的是调查直接从时间-活性曲线(TAC)中提取的代谢和血流特征是否有助于区分肺病变的良性和恶性情况。
从每次动态18F-FDG PET/CT扫描中提取原发灶的TAC。然后通过动力学建模将TAC信号分解为代谢和血流成分。接着从每个成分中提取包括曲线下面积(AUC)、峰值时间和斜率等动态特征。从187例患者(平均年龄60.41±11.01岁;117例男性)提取的特征用于训练基于装袋法的分类模型,装袋法是一种基于决策树构建的机器学习方法。使用带有交叉验证的受试者工作特征分析来测试训练模型在区分良性和恶性方面的性能。然后对由42次动态扫描组成的独立数据集进行外部测试。对于结果,使用SHapley加性解释(SHAP)来评估各个特征对诊断的相对重要性。还绘制了瀑布图,并评估了科恩效应量,以证明所提出模型相对于SUVmax和净FDG流入率K的优越性。
多种动态特征的组合能够区分良性和恶性病变。对于交叉验证,训练模型的AUC为0.89,灵敏度为0.80,特异性为0.88,显著高于SUVmax(AUC = 0.79,德龙检验p < 0.001)或K(AUC = 0.76,德龙检验p < 0.001)。对于测试数据集,模型的AUC为0.86,同样优于SUVmax(AUC为0.72)或K(AUC为0.71)。SHAP确定的对诊断贡献最大的特征包括病变处的斜率和最大代谢TAC、AUC以及病变处血液TAC的峰值时间。瀑布图表明,该模型在良性和恶性组之间的预测分数有显著差异(p < 0.001),科恩效应量为1.71,高于SUV和K的值(科恩效应量分别为0.96和0.81)。
一种结合动态FDG代谢和血流特征的可解释机器学习模型比SUVmax或净流入率K等传统参数能更准确地预测肺癌患者的良性或恶性情况。