比较不同卷积神经网络激活函数以及针对中小医疗数据集构建集成的方法。

Comparison of Different Convolutional Neural Network Activation Functions and Methods for Building Ensembles for Small to Midsize Medical Data Sets.

机构信息

Department of Information Engineering, University of Padua, Via Gradenigo 6, 35131 Padova, Italy.

Department of Information Technology and Cybersecurity, Missouri State University, 901 S. National Street, Springfield, MO 65804, USA.

出版信息

Sensors (Basel). 2022 Aug 16;22(16):6129. doi: 10.3390/s22166129.

DOI:10.3390/s22166129

PMID:36015898

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9415767/

Abstract

CNNs and other deep learners are now state-of-the-art in medical imaging research. However, the small sample size of many medical data sets dampens performance and results in overfitting. In some medical areas, it is simply too labor-intensive and expensive to amass images numbering in the hundreds of thousands. Building Deep CNN ensembles of pre-trained CNNs is one powerful method for overcoming this problem. Ensembles combine the outputs of multiple classifiers to improve performance. This method relies on the introduction of diversity, which can be introduced on many levels in the classification workflow. A recent ensembling method that has shown promise is to vary the activation functions in a set of CNNs or within different layers of a single CNN. This study aims to examine the performance of both methods using a large set of twenty activations functions, six of which are presented here for the first time: 2D Mexican ReLU, TanELU, MeLU + GaLU, Symmetric MeLU, Symmetric GaLU, and Flexible MeLU. The proposed method was tested on fifteen medical data sets representing various classification tasks. The best performing ensemble combined two well-known CNNs (VGG16 and ResNet50) whose standard ReLU activation layers were randomly replaced with another. Results demonstrate the superiority in performance of this approach.

摘要

CNN 及其它深度学习模型在医学影像研究中已经是最先进的了。然而，许多医学数据集的样本量较小，影响了模型的性能并导致过拟合。在一些医学领域，要积累数以十万计的图像，既费时又费钱。构建由预训练 CNN 组成的深度 CNN 集成是克服这一问题的一种强大方法。集成模型将多个分类器的输出结合起来以提高性能。该方法依赖于在分类工作流程的多个层面引入多样性。最近，一种很有前景的集成方法是在一组 CNN 中或在单个 CNN 的不同层中改变激活函数。本研究旨在使用一大组二十种激活函数来检验这两种方法的性能，其中六种是首次提出的：二维墨西哥 ReLU、TanELU、MeLU + GaLU、对称 MeLU、对称 GaLU 和灵活 MeLU。所提出的方法在十五个医学数据集上进行了测试，这些数据集代表了各种分类任务。表现最好的集成模型组合了两个著名的 CNN（VGG16 和 ResNet50），它们的标准 ReLU 激活层被随机替换为另一种。结果表明，这种方法具有优越性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

比较不同卷积神经网络激活函数以及针对中小医疗数据集构建集成的方法。

Comparison of Different Convolutional Neural Network Activation Functions and Methods for Building Ensembles for Small to Midsize Medical Data Sets.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

比较不同卷积神经网络激活函数以及针对中小医疗数据集构建集成的方法。

Comparison of Different Convolutional Neural Network Activation Functions and Methods for Building Ensembles for Small to Midsize Medical Data Sets.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献