Albashish Dheeb
Computer Science Department/ Prince Abdullah bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Alsalt, Jordan.
PeerJ Comput Sci. 2022 Jul 5;8:e1031. doi: 10.7717/peerj-cs.1031. eCollection 2022.
Deep convolutional neural networks (CNN) manifest the potential for computer-aided diagnosis systems (CADs) by learning features directly from images rather than using traditional feature extraction methods. Nevertheless, due to the limited sample sizes and heterogeneity in tumor presentation in medical images, CNN models suffer from training issues, including training from scratch, which leads to overfitting. Alternatively, a pre-trained neural network's transfer learning (TL) is used to derive tumor knowledge from medical image datasets using CNN that were designed for non-medical activations, alleviating the need for large datasets. This study proposes two ensemble learning techniques: E-CNN (product rule) and E-CNN (majority voting). These techniques are based on the adaptation of the pretrained CNN models to classify colon cancer histopathology images into various classes. In these ensembles, the individuals are, initially, constructed by adapting pretrained DenseNet121, MobileNetV2, InceptionV3, and VGG16 models. The adaptation of these models is based on a block-wise fine-tuning policy, in which a set of dense and dropout layers of these pretrained models is joined to explore the variation in the histology images. Then, the models' decisions are fused via product rule and majority voting aggregation methods. The proposed model was validated against the standard pretrained models and the most recent works on two publicly available benchmark colon histopathological image datasets: Stoean (357 images) and Kather colorectal histology (5,000 images). The results were 97.20% and 91.28% accurate, respectively. The achieved results outperformed the state-of-the-art studies and confirmed that the proposed E-CNNs could be extended to be used in various medical image applications.
深度卷积神经网络(CNN)通过直接从图像中学习特征而非使用传统特征提取方法,展现出在计算机辅助诊断系统(CAD)中的潜力。然而,由于医学图像中肿瘤表现的样本量有限且存在异质性,CNN模型存在训练问题,包括从头开始训练导致的过拟合。另外,预训练神经网络的迁移学习(TL)被用于使用为非医学激活设计的CNN从医学图像数据集中获取肿瘤知识,从而减少了对大数据集的需求。本研究提出了两种集成学习技术:E-CNN(乘积规则)和E-CNN(多数投票)。这些技术基于对预训练CNN模型的调整,以将结肠癌组织病理学图像分类为不同类别。在这些集成中,个体最初通过调整预训练的DenseNet121、MobileNetV2、InceptionV3和VGG16模型构建。这些模型的调整基于逐块微调策略,其中将这些预训练模型的一组密集层和随机失活层连接起来,以探索组织学图像中的变化。然后,通过乘积规则和多数投票聚合方法融合模型的决策。所提出的模型在两个公开可用的基准结肠组织病理学图像数据集:Stoean(357张图像)和Kather结肠组织学(5000张图像)上,与标准预训练模型和最新研究成果进行了验证。结果的准确率分别为97.20%和91.28%。所取得的结果优于现有研究,并证实所提出的E-CNN可扩展用于各种医学图像应用。