ReLU 激活函数的深度网络与线性样条型方法的比较。

A comparison of deep networks with ReLU activation function and linear spline-type methods.

机构信息

Leiden University, Mathematical Institute, Niels Bohrweg 1, 2333 CA Leiden, The Netherlands.

出版信息

Neural Netw. 2019 Feb;110:232-242. doi: 10.1016/j.neunet.2018.11.005. Epub 2018 Dec 4.

DOI:10.1016/j.neunet.2018.11.005

Abstract

Deep neural networks (DNNs) generate much richer function spaces than shallow networks. Since the function spaces induced by shallow networks have several approximation theoretic drawbacks, this explains, however, not necessarily the success of deep networks. In this article we take another route by comparing the expressive power of DNNs with ReLU activation function to linear spline methods. We show that MARS (multivariate adaptive regression splines) is improper learnable by DNNs in the sense that for any given function that can be expressed as a function in MARS with M parameters there exists a multilayer neural network with O(Mlog(M∕ε)) parameters that approximates this function up to sup-norm error ε. We show a similar result for expansions with respect to the Faber-Schauder system. Based on this, we derive risk comparison inequalities that bound the statistical risk of fitting a neural network by the statistical risk of spline-based methods. This shows that deep networks perform better or only slightly worse than the considered spline methods. We provide a constructive proof for the function approximations.

摘要

深度神经网络（DNNs）生成的函数空间比浅层网络丰富得多。由于浅层网络诱导的函数空间具有几个逼近理论上的缺点，但这并不能解释深度网络的成功。在本文中，我们通过将具有 ReLU 激活函数的 DNN 的表达能力与线性样条方法进行比较，走了另一条路。我们表明，在 MARS（多元自适应回归样条）的意义上，DNNs 是不合适可学习的，即对于任何可以表示为具有 M 参数的 MARS 函数的给定函数，都存在一个具有 O(Mlog(M∕ε))参数的多层神经网络，该神经网络可以将这个函数逼近到 sup-norm 误差 ε。我们对 Faber-Schauder 系统的展开也得到了类似的结果。基于此，我们导出了风险比较不等式，这些不等式将神经网络拟合的统计风险与基于样条的方法的统计风险联系起来。这表明深度网络的表现要么更好，要么仅略逊于所考虑的样条方法。我们为函数逼近提供了一个构造性证明。

相似文献

A comparison of deep networks with ReLU activation function and linear spline-type methods.ReLU 激活函数的深度网络与线性样条型方法的比较。

Neural Netw. 2019 Feb;110:232-242. doi: 10.1016/j.neunet.2018.11.005. Epub 2018 Dec 4.

Error bounds for approximations with deep ReLU networks.深度 ReLU 网络逼近的误差界。

Neural Netw. 2017 Oct;94:103-114. doi: 10.1016/j.neunet.2017.07.002. Epub 2017 Jul 13.

Optimal approximation of piecewise smooth functions using deep ReLU neural networks.使用深度 ReLU 神经网络对分段光滑函数进行最优逼近。

Neural Netw. 2018 Dec;108:296-330. doi: 10.1016/j.neunet.2018.08.019. Epub 2018 Sep 7.

Deep ReLU neural networks in high-dimensional approximation.高维逼近中的深度 ReLU 神经网络。

Neural Netw. 2021 Oct;142:619-635. doi: 10.1016/j.neunet.2021.07.027. Epub 2021 Jul 29.

Approximation in shift-invariant spaces with deep ReLU neural networks.深度 ReLU 神经网络在平移不变空间中的逼近。

Neural Netw. 2022 Sep;153:269-281. doi: 10.1016/j.neunet.2022.06.013. Epub 2022 Jun 16.

Analysis on the Number of Linear Regions of Piecewise Linear Neural Networks.分段线性神经网络线性区域数量分析。

IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):644-653. doi: 10.1109/TNNLS.2020.3028431. Epub 2022 Feb 3.

Sparseness Analysis in the Pretraining of Deep Neural Networks.深度学习神经网络预训练中的稀疏性分析。

IEEE Trans Neural Netw Learn Syst. 2017 Jun;28(6):1425-1438. doi: 10.1109/TNNLS.2016.2541681. Epub 2016 Mar 31.

Neural networks with ReLU powers need less depth.ReLU 激活函数的神经网络需要的深度更小。

Neural Netw. 2024 Apr;172:106073. doi: 10.1016/j.neunet.2023.12.027. Epub 2023 Dec 19.

Random Sketching for Neural Networks With ReLU.ReLU 神经网络的随机草图。

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):748-762. doi: 10.1109/TNNLS.2020.2979228. Epub 2021 Feb 4.

Low dimensional approximation and generalization of multivariate functions on smooth manifolds using deep ReLU neural networks.基于深度 ReLU 神经网络的光滑流形上多元函数的低维逼近和泛化。

Neural Netw. 2024 Jun;174:106223. doi: 10.1016/j.neunet.2024.106223. Epub 2024 Mar 1.

引用本文的文献

Accurate and robust prediction of Amyloid-β brain deposition from plasma biomarkers and clinical information using machine learning.利用机器学习从血浆生物标志物和临床信息中准确且可靠地预测β淀粉样蛋白在大脑中的沉积。

Front Aging Neurosci. 2025 Aug 18;17:1559459. doi: 10.3389/fnagi.2025.1559459. eCollection 2025.

An Explainable Approach to Parkinson's Diagnosis Using the Contrastive Explanation Method-CEM.一种使用对比解释方法（CEM）进行帕金森病诊断的可解释方法。

Diagnostics (Basel). 2025 Aug 18;15(16):2069. doi: 10.3390/diagnostics15162069.

Predicting mortality in critically ill patients with hypertension using machine learning and deep learning models.使用机器学习和深度学习模型预测重症高血压患者的死亡率。

Front Cardiovasc Med. 2025 Aug 8;12:1568907. doi: 10.3389/fcvm.2025.1568907. eCollection 2025.

Herbify: an ensemble deep learning framework integrating convolutional neural networks and vision transformers for precise herb identification.Herbify：一种集成卷积神经网络和视觉Transformer的集成深度学习框架，用于精确的草药识别。

Plant Methods. 2025 Jul 27;21(1):104. doi: 10.1186/s13007-025-01421-5.

Artificial intelligence model for application in dental traumatology.用于牙科创伤学的人工智能模型。

Eur Arch Paediatr Dent. 2025 May 31. doi: 10.1007/s40368-025-01063-0.

Development of hybrid computational model for simulation of heat transfer and temperature prediction in chemical reactors.用于模拟化学反应器中传热和温度预测的混合计算模型的开发。

Sci Rep. 2025 Apr 26;15(1):14628. doi: 10.1038/s41598-025-99937-2.

Optical multilayer thin film structure inverse design: From optimization to deep learning.光学多层薄膜结构逆向设计：从优化到深度学习

iScience. 2025 Mar 14;28(4):112222. doi: 10.1016/j.isci.2025.112222. eCollection 2025 Apr 18.

Reverse design of broadband sound absorption structure based on deep learning method.基于深度学习方法的宽带吸声结构逆向设计

Sci Rep. 2025 Jan 14;15(1):1946. doi: 10.1038/s41598-025-86077-w.

Detection of chronic obstructive pulmonary disease with deep learning using inspiratory and expiratory chest computed tomography and clinical information.利用吸气和呼气胸部计算机断层扫描及临床信息，通过深度学习检测慢性阻塞性肺疾病。

J Thorac Dis. 2024 Sep 30;16(9):6101-6111. doi: 10.21037/jtd-24-367. Epub 2024 Sep 26.

Testing Directed Acyclic Graph via Structural, Supervised and Generative Adversarial Learning.通过结构、监督和生成对抗学习测试有向无环图

J Am Stat Assoc. 2024;119(547):1833-1846. doi: 10.1080/01621459.2023.2220169. Epub 2023 Jul 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ReLU 激活函数的深度网络与线性样条型方法的比较。

A comparison of deep networks with ReLU activation function and linear spline-type methods.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献