Department of Information Engineering, University of Padua, Via Gradenigo 6, 35131 Padova, Italy.
Department of Information Technology and Cybersecurity, Missouri State University, 901 S. National Street, Springfield, MO 65804, USA.
Sensors (Basel). 2022 Aug 16;22(16):6129. doi: 10.3390/s22166129.
CNNs and other deep learners are now state-of-the-art in medical imaging research. However, the small sample size of many medical data sets dampens performance and results in overfitting. In some medical areas, it is simply too labor-intensive and expensive to amass images numbering in the hundreds of thousands. Building Deep CNN ensembles of pre-trained CNNs is one powerful method for overcoming this problem. Ensembles combine the outputs of multiple classifiers to improve performance. This method relies on the introduction of diversity, which can be introduced on many levels in the classification workflow. A recent ensembling method that has shown promise is to vary the activation functions in a set of CNNs or within different layers of a single CNN. This study aims to examine the performance of both methods using a large set of twenty activations functions, six of which are presented here for the first time: 2D Mexican ReLU, TanELU, MeLU + GaLU, Symmetric MeLU, Symmetric GaLU, and Flexible MeLU. The proposed method was tested on fifteen medical data sets representing various classification tasks. The best performing ensemble combined two well-known CNNs (VGG16 and ResNet50) whose standard ReLU activation layers were randomly replaced with another. Results demonstrate the superiority in performance of this approach.
CNN 及其它深度学习模型在医学影像研究中已经是最先进的了。然而,许多医学数据集的样本量较小,影响了模型的性能并导致过拟合。在一些医学领域,要积累数以十万计的图像,既费时又费钱。构建由预训练 CNN 组成的深度 CNN 集成是克服这一问题的一种强大方法。集成模型将多个分类器的输出结合起来以提高性能。该方法依赖于在分类工作流程的多个层面引入多样性。最近,一种很有前景的集成方法是在一组 CNN 中或在单个 CNN 的不同层中改变激活函数。本研究旨在使用一大组二十种激活函数来检验这两种方法的性能,其中六种是首次提出的:二维墨西哥 ReLU、TanELU、MeLU + GaLU、对称 MeLU、对称 GaLU 和灵活 MeLU。所提出的方法在十五个医学数据集上进行了测试,这些数据集代表了各种分类任务。表现最好的集成模型组合了两个著名的 CNN(VGG16 和 ResNet50),它们的标准 ReLU 激活层被随机替换为另一种。结果表明,这种方法具有优越性。