计算稳定且准确的神经网络的困难：深度学习与斯梅尔第 18 问题的障碍。

The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and Smale's 18th problem.

机构信息

Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom.

Department of Mathematics, University of Oslo, 0316 Oslo, Norway.

出版信息

Proc Natl Acad Sci U S A. 2022 Mar 22;119(12):e2107151119. doi: 10.1073/pnas.2107151119. Epub 2022 Mar 16.

DOI:10.1073/pnas.2107151119

PMID:35294283

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8944871/

Abstract

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities; however, there does not exist any algorithm, even randomized, that can train (or compute) such a NN. For any positive integers K>2 and L, there are cases where simultaneously 1) no randomized training algorithm can compute a NN correct to K digits with probability greater than 1/2; 2) there exists a deterministic training algorithm that computes a NN with K –1 correct digits, but any such (even randomized) algorithm needs arbitrarily many training data; and 3) there exists a deterministic training algorithm that computes a NN with K –2 correct digits using no more than L training samples. These results imply a classification theory describing conditions under which (stable) NNs with a given accuracy can be computed by an algorithm. We begin this theory by establishing sufficient conditions for the existence of algorithms that compute stable NNs in inverse problems. We introduce fast iterative restarted networks (FIRENETs), which we both prove and numerically verify are stable. Moreover, we prove that only O(|log (ϵ)|) layers are needed for an ϵ-accurate solution to the inverse problem.

摘要

深度学习（DL）取得了空前的成功，现在正全力进入科学计算领域。然而，目前的 DL 方法通常存在不稳定性，即使通用逼近性质保证了稳定神经网络（NN）的存在。我们通过证明科学计算中基本条件良好的问题来解决这一悖论，在这些问题中，可以证明存在具有优异逼近质量的 NN；然而，不存在任何算法，即使是随机算法，也可以训练（或计算）这样的 NN。对于任意正整数 K>2 和 L，存在这样的情况：1）没有随机训练算法可以以大于 1/2 的概率计算出正确到 K 位的 NN；2）存在确定性训练算法，可以计算出具有 K-1 位正确数字的 NN，但任何这样的（甚至是随机的）算法都需要任意多的训练数据；3）存在确定性训练算法，它可以使用不超过 L 个训练样本计算出具有 K-2 位正确数字的 NN。这些结果意味着存在一种分类理论，可以描述在给定精度下可以通过算法计算（稳定）NN 的条件。我们通过建立在反问题中计算稳定 NN 的算法存在的充分条件来开始这一理论。我们引入了快速迭代重启网络（FIRENETs），我们不仅证明了它们的稳定性，还通过数值验证进行了验证。此外，我们证明了对于逆问题的 ε-精确解，只需要 O(|log(ϵ)|) 层。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3de/8944871/5d69b0f531cf/pnas.2107151119fig01.jpg

相似文献

The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and Smale's 18th problem.计算稳定且准确的神经网络的困难：深度学习与斯梅尔第 18 问题的障碍。

Proc Natl Acad Sci U S A. 2022 Mar 22;119(12):e2107151119. doi: 10.1073/pnas.2107151119. Epub 2022 Mar 16.

The deep arbitrary polynomial chaos neural network or how Deep Artificial Neural Networks could benefit from data-driven homogeneous chaos theory.深度任意多项式混沌神经网络或深度人工神经网络如何从数据驱动的均匀混沌理论中受益。

Neural Netw. 2023 Sep;166:85-104. doi: 10.1016/j.neunet.2023.06.036. Epub 2023 Jul 10.

A Survey of Stochastic Computing Neural Networks for Machine Learning Applications.用于机器学习应用的随机计算神经网络调查。

IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):2809-2824. doi: 10.1109/TNNLS.2020.3009047. Epub 2021 Jul 6.

Effectiveness of Artificial Neural Networks for Solving Inverse Problems in Magnetic Field-Based Localization.人工神经网络在基于磁场的定位中解决逆问题的有效性。

Sensors (Basel). 2022 Mar 14;22(6):2240. doi: 10.3390/s22062240.

DEBI-NN: Distance-encoding biomorphic-informational neural networks for minimizing the number of trainable parameters.DEBI-NN：用于最小化可训练参数数量的距离编码生物形态信息神经网络。

Neural Netw. 2023 Oct;167:517-532. doi: 10.1016/j.neunet.2023.08.026. Epub 2023 Aug 25.

Demystifying artificial intelligence and deep learning in dentistry.揭开牙科人工智能和深度学习的神秘面纱。

Braz Oral Res. 2021 Aug 13;35:e094. doi: 10.1590/1807-3107bor-2021.vol35.0094. eCollection 2021.

High frequency accuracy and loss data of random neural networks trained on image datasets.在图像数据集上训练的随机神经网络的高频精度和损失数据。

Data Brief. 2022 Jan 5;40:107780. doi: 10.1016/j.dib.2021.107780. eCollection 2022 Feb.

Analysis of Diffractive Optical Neural Networks and Their Integration with Electronic Neural Networks.衍射光学神经网络及其与电子神经网络集成的分析

IEEE J Sel Top Quantum Electron. 2020 Jan-Feb;26(1). doi: 10.1109/JSTQE.2019.2921376. Epub 2019 Jun 6.

Residual DNN: training diffractive deep neural networks via learnable light shortcuts.残留 DNN：通过可学习的光捷径训练衍射深度神经网络。

Opt Lett. 2020 May 15;45(10):2688-2691. doi: 10.1364/OL.389696.

Memory Recall: A Simple Neural Network Training Framework Against Catastrophic Forgetting.记忆召回：一种针对灾难性遗忘的简单神经网络训练框架。

IEEE Trans Neural Netw Learn Syst. 2022 May;33(5):2010-2022. doi: 10.1109/TNNLS.2021.3099700. Epub 2022 May 2.

引用本文的文献

Soft Bioelectronic Interfaces for Continuous Peripheral Neural Signal Recording and Robust Cross-Subject Decoding.用于连续外周神经信号记录和稳健跨受试者解码的柔性生物电子接口。

Adv Sci (Weinh). 2025 Sep;12(33):e14732. doi: 10.1002/advs.202414732. Epub 2025 May 28.

On the uncertainty principle of neural networks.论神经网络的不确定性原理。

iScience. 2025 Mar 10;28(4):112197. doi: 10.1016/j.isci.2025.112197. eCollection 2025 Apr 18.

The Detection of COVID-19-Related Multivariate Biomarker Immune Response in Pediatric Patients: Statistical Aspects.儿科患者中新冠病毒相关多变量生物标志物免疫反应的检测：统计学方面

Viruses. 2025 Feb 21;17(3):297. doi: 10.3390/v17030297.

Data Flow-Based Strategies to Improve the Interpretation and Understanding of Machine Learning Models.基于数据流的策略，以改进对机器学习模型的解释和理解。

Bioengineering (Basel). 2024 Nov 25;11(12):1189. doi: 10.3390/bioengineering11121189.

Exploring the uncertainty principle in neural networks through binary classification.通过二元分类探索神经网络中的不确定性原理。

Sci Rep. 2024 Nov 18;14(1):28402. doi: 10.1038/s41598-024-79028-4.

Singular-Value-Decomposition-Based Matrix Surgery.基于奇异值分解的矩阵手术

Entropy (Basel). 2024 Aug 17;26(8):701. doi: 10.3390/e26080701.

Quantum-inspired analysis of neural network vulnerabilities: the role of conjugate variables in system attacks.

Natl Sci Rev. 2024 Apr 11;11(9):nwae141. doi: 10.1093/nsr/nwae141. eCollection 2024 Sep.

The limit of human intelligence.人类智力的极限。

Heliyon. 2024 Jun 10;10(12):e32465. doi: 10.1016/j.heliyon.2024.e32465. eCollection 2024 Jun 30.

Clinical Super-Resolution Computed Tomography of Bone Microstructure: Application in Musculoskeletal and Dental Imaging.临床超分辨率计算机断层扫描骨微结构：肌肉骨骼和牙科成像中的应用。

Ann Biomed Eng. 2024 May;52(5):1255-1269. doi: 10.1007/s10439-024-03450-y. Epub 2024 Feb 15.

Integrating Artificial Intelligence Tools in the Clinical Research Setting: The Ovarian Cancer Use Case.在临床研究环境中整合人工智能工具：卵巢癌应用案例

Diagnostics (Basel). 2023 Aug 30;13(17):2813. doi: 10.3390/diagnostics13172813.

本文引用的文献

Solving Inverse Problems With Deep Neural Networks - Robustness Included?使用深度神经网络解决逆问题——包括鲁棒性吗？

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):1119-1134. doi: 10.1109/TPAMI.2022.3148324. Epub 2022 Dec 5.

Results of the 2020 fastMRI Challenge for Machine Learning MR Image Reconstruction.2020 年快速 MRI 挑战赛机器学习磁共振图像重建结果。

IEEE Trans Med Imaging. 2021 Sep;40(9):2306-2317. doi: 10.1109/TMI.2021.3075856. Epub 2021 Aug 31.

The promise and peril of deep learning in microscopy.深度学习在显微镜学中的前景与风险。

Nat Methods. 2021 Feb;18(2):131-132. doi: 10.1038/s41592-020-01035-w.

Advancing machine learning for MR image reconstruction with an open competition: Overview of the 2019 fastMRI challenge.利用开放竞赛推进磁共振图像重建中的机器学习：2019 年 fastMRI 挑战赛概述。

Magn Reson Med. 2020 Dec;84(6):3054-3070. doi: 10.1002/mrm.28338. Epub 2020 Jun 7.

On instabilities of deep learning in image reconstruction and the potential costs of AI.深度学习在图像重建中的不稳定性及人工智能的潜在代价

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30088-30095. doi: 10.1073/pnas.1907377117. Epub 2020 May 11.

Applications, promises, and pitfalls of deep learning for fluorescence image reconstruction.深度学习在荧光图像重建中的应用、前景与挑战。

Nat Methods. 2019 Dec;16(12):1215-1225. doi: 10.1038/s41592-019-0458-z. Epub 2019 Jul 8.

Adversarial attacks on medical machine learning.对医学机器学习的对抗攻击。

Science. 2019 Mar 22;363(6433):1287-1289. doi: 10.1126/science.aaw4399.

Image reconstruction by domain-transform manifold learning.基于域变换流形学习的图像重建。

Nature. 2018 Mar 21;555(7697):487-492. doi: 10.1038/nature25988.

Learning a variational network for reconstruction of accelerated MRI data.学习用于加速 MRI 数据重建的变分网络。

Magn Reson Med. 2018 Jun;79(6):3055-3071. doi: 10.1002/mrm.26977. Epub 2017 Nov 8.

Deep Convolutional Neural Network for Inverse Problems in Imaging.基于深度卷积神经网络的医学影像反问题研究

IEEE Trans Image Process. 2017 Sep;26(9):4509-4522. doi: 10.1109/TIP.2017.2713099. Epub 2017 Jun 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

计算稳定且准确的神经网络的困难：深度学习与斯梅尔第 18 问题的障碍。

The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and Smale's 18th problem.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献