参数变形指数线性单元在深度神经网络中的应用。

Parametric Deformable Exponential Linear Units for deep neural networks.

机构信息

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, Sichuan, China.

出版信息

Neural Netw. 2020 May;125:281-289. doi: 10.1016/j.neunet.2020.02.012. Epub 2020 Feb 26.

DOI:10.1016/j.neunet.2020.02.012

PMID:32151915

Abstract

Rectified activation units make an important contribution to the success of deep neural networks in many computer vision tasks. In this paper, we propose a Parametric Deformable Exponential Linear Unit (PDELU) and theoretically verify its effectiveness for improving the convergence speed of learning procedure. By means of flexible map shape, the proposed PDELU could push the mean value of activation responses closer to zero, which ensures the steepest descent in training a deep neural network. We verify the effectiveness of the proposed method in the image classification task. Extensive experiments on three classical databases (i.e., CIFAR-10, CIFAR-100, and ImageNet-2015) indicate that the proposed method leads to higher convergence speed and better accuracy when it is embedded into different CNN architectures (i.e., NIN, ResNet, WRN, and DenseNet). Meanwhile, the proposed PDELU outperforms many existing shape-specific activation functions (i.e., Maxout, ReLU, LeakyReLU, ELU, SELU, SoftPlus, Swish) and the shape-adaptive activation functions (i.e., APL, PReLU, MPELU, FReLU).

摘要

修正激活单元对深度神经网络在许多计算机视觉任务中的成功做出了重要贡献。在本文中，我们提出了一种参数可变形指数线性单元（PDELU），并从理论上验证了它对提高学习过程收敛速度的有效性。通过灵活的映射形状，所提出的 PDELU 可以将激活响应的平均值推近零，从而确保在训练深度神经网络时能够进行最快的下降。我们在图像分类任务中验证了所提出方法的有效性。在三个经典数据库（即 CIFAR-10、CIFAR-100 和 ImageNet-2015）上的广泛实验表明，当将所提出的方法嵌入到不同的 CNN 架构（即 NIN、ResNet、WRN 和 DenseNet）中时，它可以实现更高的收敛速度和更好的准确性。同时，所提出的 PDELU 优于许多现有的特定形状激活函数（即 Maxout、ReLU、LeakyReLU、ELU、SELU、SoftPlus、Swish）和形状自适应激活函数（即 APL、PReLU、MPELU、FReLU）。

相似文献

Parametric Deformable Exponential Linear Units for deep neural networks.

Neural Netw. 2020 May;125:281-289. doi: 10.1016/j.neunet.2020.02.012. Epub 2020 Feb 26.

SPLASH: Learnable activation functions for improving accuracy and adversarial robustness.

Neural Netw. 2021 Aug;140:1-12. doi: 10.1016/j.neunet.2021.02.023. Epub 2021 Mar 4.

Improving the Antinoise Ability of DNNs via a Bio-Inspired Noise Adaptive Activation Function Rand Softplus.

Neural Comput. 2019 Jun;31(6):1215-1233. doi: 10.1162/neco_a_01192. Epub 2019 Apr 12.

Discovering Parametric Activation Functions.

Neural Netw. 2022 Apr;148:48-65. doi: 10.1016/j.neunet.2022.01.001. Epub 2022 Jan 7.

Tree-CNN: A hierarchical Deep Convolutional Neural Network for incremental learning.

Neural Netw. 2020 Jan;121:148-160. doi: 10.1016/j.neunet.2019.09.010. Epub 2019 Sep 19.

Fast generalization error bound of deep learning without scale invariance of activation functions.

Neural Netw. 2020 Sep;129:344-358. doi: 10.1016/j.neunet.2020.05.033. Epub 2020 Jun 22.

Image denoising using deep CNN with batch renormalization.

Neural Netw. 2020 Jan;121:461-473. doi: 10.1016/j.neunet.2019.08.022. Epub 2019 Sep 5.

Deep Convolutional Neural Network for Ulcer Recognition in Wireless Capsule Endoscopy: Experimental Feasibility and Optimization.

Comput Math Methods Med. 2019 Sep 18;2019:7546215. doi: 10.1155/2019/7546215. eCollection 2019.

Multi-way backpropagation for training compact deep neural networks.

Neural Netw. 2020 Jun;126:250-261. doi: 10.1016/j.neunet.2020.03.001. Epub 2020 Mar 26.

Adaptively Customizing Activation Functions for Various Layers.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6096-6107. doi: 10.1109/TNNLS.2021.3133263. Epub 2023 Sep 1.

引用本文的文献

Comparison of Different Convolutional Neural Network Activation Functions and Methods for Building Ensembles for Small to Midsize Medical Data Sets.

Sensors (Basel). 2022 Aug 16;22(16):6129. doi: 10.3390/s22166129.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

参数变形指数线性单元在深度神经网络中的应用。

Parametric Deformable Exponential Linear Units for deep neural networks.

机构信息

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, Sichuan, China.

出版信息

Neural Netw. 2020 May;125:281-289. doi: 10.1016/j.neunet.2020.02.012. Epub 2020 Feb 26.

DOI:10.1016/j.neunet.2020.02.012

PMID:32151915

Abstract

摘要

参数变形指数线性单元在深度神经网络中的应用。

Parametric Deformable Exponential Linear Units for deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

参数变形指数线性单元在深度神经网络中的应用。

Parametric Deformable Exponential Linear Units for deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献