• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ReLU 层的奇异值。

Singular Values for ReLU Layers.

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3594-3605. doi: 10.1109/TNNLS.2019.2945113. Epub 2019 Nov 5.

DOI:10.1109/TNNLS.2019.2945113
PMID:31714239
Abstract

Despite their prevalence in neural networks, we still lack a thorough theoretical characterization of rectified linear unit (ReLU) layers. This article aims to further our understanding of ReLU layers by studying how the activation function ReLU interacts with the linear component of the layer and what role this interaction plays in the success of the neural network in achieving its intended task. To this end, we introduce two new tools: ReLU singular values of operators and the Gaussian mean width of operators. By presenting, on the one hand, theoretical justifications, results, and interpretations of these two concepts and, on the other hand, numerical experiments and results of the ReLU singular values and the Gaussian mean width being applied to trained neural networks, we hope to give a comprehensive, singular-value-centric view of ReLU layers. We find that ReLU singular values and the Gaussian mean width do not only enable theoretical insights but also provide one with metrics that seem promising for practical applications. In particular, these measures can be used to distinguish correctly and incorrectly classified data as it traverses the network. We conclude by introducing two tools based on our findings: double layers and harmonic pruning.

摘要

尽管在神经网络中很常见,但我们仍然缺乏对修正线性单元 (ReLU) 层的全面理论描述。本文旨在通过研究 ReLU 激活函数如何与层的线性部分相互作用,以及这种相互作用在神经网络成功完成其预期任务中的作用,进一步了解 ReLU 层。为此,我们引入了两个新工具:算子的 ReLU 奇异值和算子的高斯平均宽度。一方面,我们提出了这两个概念的理论依据、结果和解释,另一方面,我们进行了数值实验,并将 ReLU 奇异值和高斯平均宽度应用于训练好的神经网络,希望能从奇异值的角度全面、深入地了解 ReLU 层。我们发现,ReLU 奇异值和高斯平均宽度不仅可以提供理论上的深入理解,还为实际应用提供了有前途的指标。特别是,这些度量可以用于区分网络中正确和错误分类的数据。最后,我们基于研究结果引入了两个工具:双层和调和剪枝。

相似文献

1
Singular Values for ReLU Layers.ReLU 层的奇异值。
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3594-3605. doi: 10.1109/TNNLS.2019.2945113. Epub 2019 Nov 5.
2
Neural networks with ReLU powers need less depth.ReLU 激活函数的神经网络需要的深度更小。
Neural Netw. 2024 Apr;172:106073. doi: 10.1016/j.neunet.2023.12.027. Epub 2023 Dec 19.
3
ReLU Networks Are Universal Approximators via Piecewise Linear or Constant Functions.ReLU 网络通过分段线性或常数函数实现通用逼近。
Neural Comput. 2020 Nov;32(11):2249-2278. doi: 10.1162/neco_a_01316. Epub 2020 Sep 18.
4
Random Sketching for Neural Networks With ReLU.ReLU 神经网络的随机草图。
IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):748-762. doi: 10.1109/TNNLS.2020.2979228. Epub 2021 Feb 4.
5
Approximation of smooth functionals using deep ReLU networks.使用深度 ReLU 网络逼近光滑泛函。
Neural Netw. 2023 Sep;166:424-436. doi: 10.1016/j.neunet.2023.07.012. Epub 2023 Jul 18.
6
Role of Layers and Neurons in Deep Learning With the Rectified Linear Unit.具有整流线性单元的深度学习中各层和神经元的作用。
Cureus. 2021 Oct 18;13(10):e18866. doi: 10.7759/cureus.18866. eCollection 2021 Oct.
7
Convergence of deep convolutional neural networks.深度卷积神经网络的融合。
Neural Netw. 2022 Sep;153:553-563. doi: 10.1016/j.neunet.2022.06.031. Epub 2022 Jun 30.
8
Analytical Bounds on the Local Lipschitz Constants of ReLU Networks.ReLU网络局部Lipschitz常数的解析界
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13902-13913. doi: 10.1109/TNNLS.2023.3273228. Epub 2024 Oct 7.
9
Deep ReLU neural networks in high-dimensional approximation.高维逼近中的深度 ReLU 神经网络。
Neural Netw. 2021 Oct;142:619-635. doi: 10.1016/j.neunet.2021.07.027. Epub 2021 Jul 29.
10
Recursion Newton-Like Algorithm for l-ReLU Deep Neural Networks.
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5882-5896. doi: 10.1109/TNNLS.2021.3131406. Epub 2023 Sep 1.

引用本文的文献

1
DeepAptamer: Advancing high-affinity aptamer discovery with a hybrid deep learning model.深度适体:利用混合深度学习模型推进高亲和力适体发现。
Mol Ther Nucleic Acids. 2024 Dec 21;36(1):102436. doi: 10.1016/j.omtn.2024.102436. eCollection 2025 Mar 11.
2
Synergy quality assessment of muscle modules for determining learning performance using a realistic musculoskeletal model.使用逼真的肌肉骨骼模型评估肌肉模块的协同作用质量以确定学习表现
Front Comput Neurosci. 2024 May 30;18:1355855. doi: 10.3389/fncom.2024.1355855. eCollection 2024.
3
Identification of Unique Genetic Biomarkers of Various Subtypes of Glomerulonephritis Using Machine Learning and Deep Learning.
利用机器学习和深度学习鉴定各种肾小球肾炎亚型的独特遗传生物标志物。
Biomolecules. 2022 Sep 10;12(9):1276. doi: 10.3390/biom12091276.