Attallah Omneya, Anwar Fatma, Ghanem Nagia M, Ismail Mohamed A
Department of Electronics and Communications Engineering, College of Engineering and Technology, Arab Academy for Science, Technology, and Maritime Transport, Alexandria, Alexandria, Egypt.
Computer and Systems Engineering Department, Alexandria University, Alexandria, Egypt.
PeerJ Comput Sci. 2021 Apr 27;7:e493. doi: 10.7717/peerj-cs.493. eCollection 2021.
Breast cancer (BC) is one of the most common types of cancer that affects females worldwide. It may lead to irreversible complications and even death due to late diagnosis and treatment. The pathological analysis is considered the gold standard for BC detection, but it is a challenging task. Automatic diagnosis of BC could reduce death rates, by creating a computer aided diagnosis (CADx) system capable of accurately identifying BC at an early stage and decreasing the time consumed by pathologists during examinations. This paper proposes a novel CADx system named Histo-CADx for the automatic diagnosis of BC. Most related studies were based on individual deep learning methods. Also, studies did not examine the influence of fusing features from multiple CNNs and handcrafted features. In addition, related studies did not investigate the best combination of fused features that influence the performance of the CADx. Therefore, Histo-CADx is based on two stages of fusion. The first fusion stage involves the investigation of the impact of fusing several deep learning (DL) techniques with handcrafted feature extraction methods using the auto-encoder DL method. This stage also examines and searches for a suitable set of fused features that could improve the performance of Histo-CADx. The second fusion stage constructs a multiple classifier system (MCS) for fusing outputs from three classifiers, to further improve the accuracy of the proposed Histo-CADx. The performance of Histo-CADx is evaluated using two public datasets; specifically, the BreakHis and the ICIAR 2018 datasets. The results from the analysis of both datasets verified that the two fusion stages of Histo-CADx successfully improved the accuracy of the CADx compared to CADx constructed with individual features. Furthermore, using the auto-encoder for the fusion process has reduced the computation cost of the system. Moreover, the results after the two fusion stages confirmed that Histo-CADx is reliable and has the capacity of classifying BC more accurately compared to other latest studies. Consequently, it can be used by pathologists to help them in the accurate diagnosis of BC. In addition, it can decrease the time and effort needed by medical experts during the examination.
乳腺癌(BC)是全球影响女性的最常见癌症类型之一。由于诊断和治疗延迟,它可能导致不可逆转的并发症甚至死亡。病理分析被认为是乳腺癌检测的金标准,但这是一项具有挑战性的任务。通过创建一个能够在早期准确识别乳腺癌并减少病理学家检查时间的计算机辅助诊断(CADx)系统,乳腺癌的自动诊断可以降低死亡率。本文提出了一种名为Histo-CADx的新型CADx系统用于乳腺癌的自动诊断。大多数相关研究基于单个深度学习方法。此外,研究没有考察融合来自多个卷积神经网络(CNN)的特征和手工特征的影响。另外,相关研究没有探究影响CADx性能的融合特征的最佳组合。因此,Histo-CADx基于两个融合阶段。第一个融合阶段涉及使用自动编码器深度学习方法研究融合几种深度学习(DL)技术与手工特征提取方法的影响。此阶段还检查并寻找一组合适的融合特征,以提高Histo-CADx的性能。第二个融合阶段构建一个多分类器系统(MCS)来融合三个分类器的输出,以进一步提高所提出的Histo-CADx的准确性。使用两个公共数据集评估Histo-CADx的性能;具体而言,是BreakHis和ICIAR 2018数据集。对两个数据集的分析结果证实,与使用单个特征构建的CADx相比,Histo-CADx的两个融合阶段成功提高了CADx的准确性。此外,在融合过程中使用自动编码器降低了系统的计算成本。而且,两个融合阶段后的结果证实,与其他最新研究相比,Histo-CADx是可靠的,并且具有更准确地对乳腺癌进行分类的能力。因此,病理学家可以使用它来帮助他们准确诊断乳腺癌。此外,它可以减少医学专家在检查过程中所需的时间和精力。