IEEE J Biomed Health Inform. 2022 Oct;26(10):5025-5032. doi: 10.1109/JBHI.2022.3187765. Epub 2022 Oct 4.
Breast cancer is the most common female cancer in the world, and it poses a huge threat to women's health. There is currently promising research concerning its early diagnosis using deep learning methodologies. However, some commonly used Convolutional Neural Network (CNN) and their variations, such as AlexNet, VGGNet, GoogleNet and so on, are prone to overfitting in breast cancer classification, due to both small-scale breast pathology image datasets and overconfident softmax-cross-entropy loss. To alleviate the overfitting issue for better classification accuracy, we propose a novel framework for breast pathology classification, called the AlexNet-BC model. The model is pre-trained using the ImageNet dataset and fine-tuned using an augmented dataset. We also devise an improved cross-entropy loss function to penalize overconfident low-entropy output distributions and make the predictions suitable for uniform distributions. The proposed approach is then validated through a series of comparative experiments on BreaKHis, IDC and UCSB datasets. The experimental results show that the proposed method outperforms the state-of-the-art methods at different magnifications. Its strong robustness and generalization capabilities make it suitable for histopathology clinical computer-aided diagnosis systems.
乳腺癌是全球最常见的女性癌症,对女性健康构成巨大威胁。目前,使用深度学习方法进行早期诊断的研究前景广阔。然而,一些常用的卷积神经网络(CNN)及其变体,如 AlexNet、VGGNet、GoogleNet 等,在乳腺癌分类中容易出现过拟合,这是由于小规模的乳腺病理图像数据集和过度自信的 softmax 交叉熵损失。为了缓解过拟合问题以提高分类准确性,我们提出了一种用于乳腺病理分类的新框架,称为 AlexNet-BC 模型。该模型使用 ImageNet 数据集进行预训练,并使用扩充数据集进行微调。我们还设计了一种改进的交叉熵损失函数,以惩罚过度自信的低熵输出分布,并使预测适合均匀分布。然后,我们在 BreaKHis、IDC 和 UCSB 数据集上进行了一系列对比实验来验证该方法。实验结果表明,该方法在不同放大倍数下均优于最新方法。其强大的鲁棒性和泛化能力使其适用于组织病理学临床计算机辅助诊断系统。