Demircioğlu Aydin
Institute of Diagnostic and Interventional Radiology and Neuroradiology, University Hospital Essen, Hufelandstraße 55, 45147 Essen, Germany.
Diagnostics (Basel). 2023 Oct 20;13(20):3266. doi: 10.3390/diagnostics13203266.
In radiomics, utilizing features extracted from pretrained deep networks could result in models with a higher predictive performance than those relying on hand-crafted features. This study compared the predictive performance of models trained with either deep features, hand-crafted features, or a combination of these features in terms of the area under the receiver-operating characteristic curve (AUC) and other metrics. We trained models on ten radiological datasets using five feature selection methods and three classifiers. Our results indicate that models based on deep features did not show an improved AUC compared to those utilizing hand-crafted features (deep: AUC 0.775, hand-crafted: AUC 0.789; = 0.28). Including morphological features alongside deep features led to overall improvements in prediction performance for all models (+0.02 gain in AUC; < 0.001); however, the best model did not benefit from this (+0.003 gain in AUC; = 0.57). Using all hand-crafted features in addition to the deep features resulted in a further overall improvement (+0.034 in AUC; < 0.001), but only a minor improvement could be observed for the best model (deep: AUC 0.798, hand-crafted: AUC 0.789; = 0.92). Furthermore, our results show that models based on deep features extracted from networks pretrained on medical data have no advantage in predictive performance over models relying on features extracted from networks pretrained on ImageNet data. Our study contributes a benchmarking analysis of models trained on hand-crafted and deep features from pretrained networks across multiple datasets. It also provides a comprehensive understanding of their applicability and limitations in radiomics. Our study shows, in conclusion, that models based on features extracted from pretrained deep networks do not outperform models trained on hand-crafted ones.
在放射组学中,利用从预训练深度网络提取的特征构建的模型,其预测性能可能高于那些依赖手工特征的模型。本研究根据受试者工作特征曲线下面积(AUC)及其他指标,比较了使用深度特征、手工特征或二者组合训练的模型的预测性能。我们使用五种特征选择方法和三种分类器,在十个放射学数据集上训练模型。我们的结果表明,与使用手工特征的模型相比,基于深度特征的模型在AUC上并未表现出改善(深度特征:AUC 0.775,手工特征:AUC 0.789;P = 0.28)。将形态学特征与深度特征相结合,可使所有模型的预测性能得到整体提升(AUC提升0.02;P < 0.001);然而,最佳模型并未从中受益(AUC提升0.003;P = 0.57)。除深度特征外,使用所有手工特征可进一步实现整体提升(AUC提升0.034;P < 0.001),但对于最佳模型,仅观察到微小提升(深度特征:AUC 0.798,手工特征:AUC 0.789;P = 0.92)。此外,我们的结果表明,基于从医学数据预训练网络提取的深度特征构建的模型,在预测性能方面并不优于依赖从ImageNet数据预训练网络提取的特征构建的模型。我们的研究对在多个数据集上使用手工特征和预训练网络的深度特征训练的模型进行了基准分析。它还全面了解了它们在放射组学中的适用性和局限性。总之,我们的研究表明,基于从预训练深度网络提取的特征构建的模型,其性能并不优于基于手工特征训练的模型。