Gómez Ober Van, Herraiz Joaquin L, Udías José Manuel, Haug Alexander, Papp Laszlo, Cioni Dania, Neri Emanuele
Nuclear Physics Group and IPARCOS, Faculty of Physical Sciences, University Complutense of Madrid, CEI Moncloa, 28040 Madrid, Spain.
Academic Radiology and Master in Oncologic Imaging, Department of Translational Research, University of Pisa, Via Roma, 67, 56126 Pisa, Italy.
Cancers (Basel). 2022 Jun 14;14(12):2922. doi: 10.3390/cancers14122922.
This study aimed to identify optimal combinations between feature selection methods and machine-learning classifiers for predicting the metabolic response of individual metastatic breast cancer lesions, based on clinical variables and radiomic features extracted from pretreatment [F]F-FDG PET/CT images.
A total of 48 patients with confirmed metastatic breast cancer, who received different treatments, were included. All patients had an [F]F-FDG PET/CT scan before and after the treatment. From 228 metastatic lesions identified, 127 were categorized as responders (complete or partial metabolic response) and 101 as non-responders (stable or progressive metabolic response), by using the percentage changes in SULpeak (peak standardized uptake values normalized for body lean body mass). The lesion pool was divided into training (n = 182) and testing cohorts (n = 46); for each lesion, 101 image features from both PET and CT were extracted (202 features per lesion). These features, along with clinical and pathological information, allowed the prediction model's construction by using seven popular feature selection methods in cross-combination with another seven machine-learning (ML) classifiers. The performance of the different models was investigated with the receiver-operating characteristic curve (ROC) analysis, using the area under the curve (AUC) and accuracy (ACC) metrics.
The combinations, least absolute shrinkage and selection operator (Lasso) + support vector machines (SVM), or random forest (RF) had the highest AUC in the cross-validation, with 0.93 ± 0.06 and 0.92 ± 0.03, respectively, whereas Lasso + neural network (NN) or SVM, and mutual information (MI) + RF, had the higher AUC and ACC in the validation cohort, with 0.90/0.72, 0.86/0.76, and 87/85, respectively. On average, the models with Lasso and models with SVM had the best mean performance for both AUC and ACC in both training and validation cohorts.
Image features obtained from a pretreatment [F]F-FDG PET/CT along with clinical vaiables could predict the metabolic response of metastatic breast cancer lesions, by their incorporation into predictive models, whose performance depends on the selected combination between feature selection and ML classifier methods.
本研究旨在基于从治疗前的[F]F-FDG PET/CT图像中提取的临床变量和影像组学特征,确定特征选择方法与机器学习分类器之间的最佳组合,以预测个体转移性乳腺癌病灶的代谢反应。
共纳入48例确诊的转移性乳腺癌患者,这些患者接受了不同的治疗。所有患者在治疗前后均进行了[F]F-FDG PET/CT扫描。通过使用SULpeak(根据去脂体重标准化的峰值标准化摄取值)的百分比变化,从228个确定的转移性病灶中,将127个分类为反应者(完全或部分代谢反应),101个分类为无反应者(稳定或进展性代谢反应)。病灶库分为训练组(n = 182)和测试组(n = 46);对于每个病灶,从PET和CT中提取101个图像特征(每个病灶202个特征)。这些特征与临床和病理信息一起,通过使用七种流行的特征选择方法与另外七种机器学习(ML)分类器的交叉组合来构建预测模型。使用曲线下面积(AUC)和准确性(ACC)指标,通过受试者操作特征曲线(ROC)分析来研究不同模型性能。
在交叉验证中,最小绝对收缩和选择算子(Lasso)+支持向量机(SVM)或随机森林(RF)的组合具有最高的AUC,分别为0.93±0.06和0.92±0.03,而在验证队列中,Lasso+神经网络(NN)或SVM以及互信息(MI)+RF具有较高的AUC和ACC,分别为0.90/0.72、0.86/0.76和0.87/0.85。平均而言,在训练和验证队列中,具有Lasso的模型和具有SVM的模型在AUC和ACC方面均具有最佳的平均性能。
从治疗前的[F]F-FDG PET/CT获得的图像特征以及临床变量,通过将其纳入预测模型,可以预测转移性乳腺癌病灶的代谢反应,其性能取决于特征选择和ML分类器方法之间的选定组合。