Institute of Medical Science and Technology, Shahid Beheshti University, Tehran, Iran.
J Digit Imaging. 2023 Dec;36(6):2602-2612. doi: 10.1007/s10278-023-00887-w. Epub 2023 Aug 2.
Breast cancer is the second most common cancer among women worldwide, and the diagnosis by pathologists is a time-consuming procedure and subjective. Computer-aided diagnosis frameworks are utilized to relieve pathologist workload by classifying the data automatically, in which deep convolutional neural networks (CNNs) are effective solutions. The features extracted from the activation layer of pre-trained CNNs are called deep convolutional activation features (DeCAF). In this paper, we have analyzed that all DeCAF features are not necessarily led to higher accuracy in the classification task and dimension reduction plays an important role. We have proposed reduced DeCAF (R-DeCAF) for this purpose, and different dimension reduction methods are applied to achieve an effective combination of features by capturing the essence of DeCAF features. This framework uses pre-trained CNNs such as AlexNet, VGG-16, and VGG-19 as feature extractors in transfer learning mode. The DeCAF features are extracted from the first fully connected layer of the mentioned CNNs, and a support vector machine is used for classification. Among linear and nonlinear dimensionality reduction algorithms, linear approaches such as principal component analysis (PCA) represent a better combination among deep features and lead to higher accuracy in the classification task using a small number of features considering a specific amount of cumulative explained variance (CEV) of features. The proposed method is validated using experimental BreakHis and ICIAR datasets. Comprehensive results show improvement in the classification accuracy up to 4.3% with a feature vector size (FVS) of 23 and CEV equal to 0.15.
乳腺癌是全球女性中第二常见的癌症,病理学家的诊断是一个耗时且主观的过程。计算机辅助诊断框架可通过自动分类数据来减轻病理学家的工作量,其中深度卷积神经网络(CNN)是有效的解决方案。从预训练 CNN 的激活层中提取的特征称为深度卷积激活特征(DeCAF)。在本文中,我们分析了并非所有 DeCAF 特征都一定能提高分类任务的准确性,降维和特征选择在其中起着重要作用。为此,我们提出了简化 DeCAF(R-DeCAF),应用不同的降维方法通过捕捉 DeCAF 特征的本质来实现特征的有效组合。该框架使用预训练的 CNN(如 AlexNet、VGG-16 和 VGG-19)作为特征提取器,在迁移学习模式下工作。从上述 CNN 的第一个全连接层中提取 DeCAF 特征,并使用支持向量机进行分类。在线性和非线性降维算法中,线性方法(如主成分分析(PCA))在深特征之间表现出更好的组合,并在考虑特定特征累积解释方差(CEV)的情况下,使用少量特征实现更高的分类任务准确性。该方法使用实验性的 BreakHis 和 ICIAR 数据集进行验证。综合结果表明,在特征向量大小(FVS)为 23 和 CEV 等于 0.15 的情况下,分类精度提高了 4.3%。