关于现代可训练激活函数的一项调查。

A survey on modern trainable activation functions.

作者信息

Apicella Andrea, Donnarumma Francesco, Isgrò Francesco, Prevete Roberto

机构信息

Dipartimento di Ingegneria Elettrica e delle Tecnologie dell'Informazione, Università di Napoli Federico II, Italy.

Institute of Cognitive Sciences and Technologies (ISTC), National Research Council (CNR), Via San Martino della Battaglia 44, 00185 Rome, Italy.

出版信息

Neural Netw. 2021 Jun;138:14-32. doi: 10.1016/j.neunet.2021.01.026. Epub 2021 Feb 9.

DOI:10.1016/j.neunet.2021.01.026

PMID:33611065

Abstract

In neural networks literature, there is a strong interest in identifying and defining activation functions which can improve neural network performance. In recent years there has been a renovated interest in the scientific community in investigating activation functions which can be trained during the learning process, usually referred to as trainable, learnable or adaptable activation functions. They appear to lead to better network performance. Diverse and heterogeneous models of trainable activation function have been proposed in the literature. In this paper, we present a survey of these models. Starting from a discussion on the use of the term "activation function" in literature, we propose a taxonomy of trainable activation functions, highlight common and distinctive proprieties of recent and past models, and discuss main advantages and limitations of this type of approach. We show that many of the proposed approaches are equivalent to adding neuron layers which use fixed (non-trainable) activation functions and some simple local rule that constrains the corresponding weight layers.

摘要

在神经网络文献中，人们对识别和定义能够提高神经网络性能的激活函数有着浓厚的兴趣。近年来，科学界对研究在学习过程中可训练的激活函数（通常称为可训练、可学习或自适应激活函数）重新产生了兴趣。它们似乎能带来更好的网络性能。文献中已经提出了各种各样、异质的可训练激活函数模型。在本文中，我们对这些模型进行了综述。从讨论文献中“激活函数”一词的使用开始，我们提出了可训练激活函数的分类法，突出了近期和过去模型的共同和独特特性，并讨论了这种方法的主要优点和局限性。我们表明，许多提出的方法相当于添加了使用固定（不可训练）激活函数的神经元层以及一些约束相应权重层的简单局部规则。

相似文献

A survey on modern trainable activation functions.关于现代可训练激活函数的一项调查。

Neural Netw. 2021 Jun;138:14-32. doi: 10.1016/j.neunet.2021.01.026. Epub 2021 Feb 9.

Bayesian Optimization for Sparse Neural Networks With Trainable Activation Functions.具有可训练激活函数的稀疏神经网络的贝叶斯优化

IEEE Trans Pattern Anal Mach Intell. 2024 Oct;46(10):6699-6712. doi: 10.1109/TPAMI.2024.3387073. Epub 2024 Sep 5.

Stochastic Selection of Activation Layers for Convolutional Neural Networks.随机选择卷积神经网络的激活层。

Sensors (Basel). 2020 Mar 14;20(6):1626. doi: 10.3390/s20061626.

PresB-Net: parametric binarized neural network with learnable activations and shuffled grouped convolution.PresB-Net：具有可学习激活函数和随机分组卷积的参数化二值神经网络。

PeerJ Comput Sci. 2022 Jan 3;8:e842. doi: 10.7717/peerj-cs.842. eCollection 2022.

Reducing the U-Net size for practical scenarios: Virus recognition in electron microscopy images.针对实际情况缩小 U-Net 规模：电子显微镜图像中的病毒识别。

Comput Methods Programs Biomed. 2019 Sep;178:31-39. doi: 10.1016/j.cmpb.2019.05.026. Epub 2019 Jun 1.

A new type of neurons for machine learning.一种用于机器学习的新型神经元。

Int J Numer Method Biomed Eng. 2018 Feb;34(2). doi: 10.1002/cnm.2920. Epub 2017 Sep 15.

Extreme learning machine for a new hybrid morphological/linear perceptron.极限学习机用于新型混合形态学/线性感知器。

Neural Netw. 2020 Mar;123:288-298. doi: 10.1016/j.neunet.2019.12.003. Epub 2019 Dec 19.

Pedestrian attribute recognition using trainable Gabor wavelets.使用可训练伽柏小波的行人属性识别

Heliyon. 2021 Jun 30;7(6):e07422. doi: 10.1016/j.heliyon.2021.e07422. eCollection 2021 Jun.

An unsupervised parameter learning model for RVFL neural network.无监督参数学习模型在 RVFL 神经网络中的应用。

Neural Netw. 2019 Apr;112:85-97. doi: 10.1016/j.neunet.2019.01.007. Epub 2019 Jan 28.

Interpretable and lightweight convolutional neural network for EEG decoding: Application to movement execution and imagination.可解释且轻量级的卷积神经网络在 EEG 解码中的应用：在运动执行和想象中的应用。

Neural Netw. 2020 Sep;129:55-74. doi: 10.1016/j.neunet.2020.05.032. Epub 2020 May 29.

引用本文的文献

A Novel Hybrid Approach for Drowsiness Detection Using EEG Scalograms to Overcome Inter-Subject Variability.一种使用脑电图频谱图检测嗜睡的新型混合方法，以克服个体间差异。

Sensors (Basel). 2025 Sep 5;25(17):5530. doi: 10.3390/s25175530.

Predicting fertilizer treating of maize using digital image processing and deep learning approaches.利用数字图像处理和深度学习方法预测玉米施肥情况。

Sci Rep. 2025 Aug 17;15(1):30085. doi: 10.1038/s41598-025-98474-2.

A comparative study of neuro-fuzzy and neural network models in predicting length of stay in university hospital.神经模糊模型与神经网络模型在预测大学医院住院时间方面的比较研究。

BMC Health Serv Res. 2025 Mar 27;25(1):446. doi: 10.1186/s12913-025-12623-x.

A new method for Tomicus classification of forest pests based on improved ResNet50 algorithm.一种基于改进ResNet50算法的森林害虫松材线虫分类新方法。

Sci Rep. 2025 Mar 20;15(1):9665. doi: 10.1038/s41598-025-93407-5.

Contrastive self-supervised learning for neurodegenerative disorder classification.用于神经退行性疾病分类的对比自监督学习

Front Neuroinform. 2025 Feb 17;19:1527582. doi: 10.3389/fninf.2025.1527582. eCollection 2025.

The Central Composite Design and Artificial Neural Network Coupled with Genetic Algorithm in Optimization and Modeling of the Radiolabeling Process of Lu-hydroxyapatite as a Potential Radiosynovectomy Agent.中心复合设计与人工神经网络结合遗传算法在优化和建模潜在放射性滑膜切除剂Lu-羟基磷灰石放射性标记过程中的应用

Curr Radiopharm. 2025;18(3):201-215. doi: 10.2174/0118744710336283250227020659.

Prediction of Member Forces of Steel Tubes on the Basis of a Sensor System with the Use of AI.基于使用人工智能的传感器系统对钢管构件内力的预测

Sensors (Basel). 2025 Feb 3;25(3):919. doi: 10.3390/s25030919.

Anomaly Detection Method for Harmonic Reducers with Only Healthy Data.仅使用健康数据的谐波减速器异常检测方法

Sensors (Basel). 2024 Nov 21;24(23):7435. doi: 10.3390/s24237435.

Deep Complex Gated Recurrent Networks-Based IoT Network Intrusion Detection Systems.基于深度复杂门控循环网络的物联网网络入侵检测系统

Sensors (Basel). 2024 Sep 13;24(18):5933. doi: 10.3390/s24185933.

Identifying defects and varieties of Malting Barley Kernels.识别麦芽大麦籽粒的缺陷和品种。

Sci Rep. 2024 Sep 27;14(1):22143. doi: 10.1038/s41598-024-73683-3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关于现代可训练激活函数的一项调查。

A survey on modern trainable activation functions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献