• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

归一化保持:残差网络为何可以变得非常深?

Norm-Preservation: Why Residual Networks Can Become Extremely Deep?

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3980-3990. doi: 10.1109/TPAMI.2020.2990339. Epub 2021 Oct 1.

DOI:10.1109/TPAMI.2020.2990339
PMID:32340937
Abstract

Augmenting neural networks with skip connections, as introduced in the so-called ResNet architecture, surprised the community by enabling the training of networks of more than 1,000 layers with significant performance gains. This paper deciphers ResNet by analyzing the effect of skip connections, and puts forward new theoretical results on the advantages of identity skip connections in neural networks. We prove that the skip connections in the residual blocks facilitate preserving the norm of the gradient, and lead to stable back-propagation, which is desirable from optimization perspective. We also show that, perhaps surprisingly, as more residual blocks are stacked, the norm-preservation of the network is enhanced. Our theoretical arguments are supported by extensive empirical evidence. Can we push for extra norm-preservation? We answer this question by proposing an efficient method to regularize the singular values of the convolution operator and making the ResNet's transition layers extra norm-preserving. Our numerical investigations demonstrate that the learning dynamics and the classification performance of ResNet can be improved by making it even more norm preserving. Our results and the introduced modification for ResNet, referred to as Procrustes ResNets, can be used as a guide for training deeper networks and can also inspire new deeper architectures.

摘要

残差网络(ResNet)通过引入跳跃连接,使得训练超过 1000 层的网络成为可能,并取得了显著的性能提升,这一突破令研究界感到惊讶。本文通过分析跳跃连接的作用来解析 ResNet,并提出了关于神经网络中恒等跳跃连接优势的新理论结果。我们证明了残差块中的跳跃连接有助于保持梯度的范数,从而实现稳定的反向传播,这从优化的角度来看是理想的。我们还表明,令人惊讶的是,随着堆叠的残差块数量的增加,网络的范数保持能力会增强。我们的理论论证得到了广泛的实证证据的支持。我们能否进一步推动范数保持呢?通过提出一种有效的正则化卷积算子奇异值的方法,使 ResNet 的转换层具有额外的范数保持能力,我们回答了这个问题。我们的数值研究表明,通过使 ResNet 具有更强的范数保持能力,可以改善其学习动态和分类性能。我们的结果和引入的 ResNet 改进方法(称为 Procrustes ResNets)可以作为训练更深网络的指导,并启发新的更深层次的架构。

相似文献

1
Norm-Preservation: Why Residual Networks Can Become Extremely Deep?归一化保持:残差网络为何可以变得非常深?
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3980-3990. doi: 10.1109/TPAMI.2020.2990339. Epub 2021 Oct 1.
2
Improved Residual Network based on norm-preservation for visual recognition.基于范数保持的视觉识别改进残差网络。
Neural Netw. 2023 Jan;157:305-322. doi: 10.1016/j.neunet.2022.10.023. Epub 2022 Oct 28.
3
Efficient Skip Connections-Based Residual Network (ESRNet) for Brain Tumor Classification.用于脑肿瘤分类的基于高效跳跃连接的残差网络(ESRNet)
Diagnostics (Basel). 2023 Oct 17;13(20):3234. doi: 10.3390/diagnostics13203234.
4
Why Is Everyone Training Very Deep Neural Network With Skip Connections?为什么每个人都在训练带有跳跃连接的深度神经网络?
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5961-5975. doi: 10.1109/TNNLS.2021.3131813. Epub 2023 Sep 1.
5
Protein Contact Map Prediction Based on ResNet and DenseNet.基于 ResNet 和 DenseNet 的蛋白质接触图预测。
Biomed Res Int. 2020 Apr 6;2020:7584968. doi: 10.1155/2020/7584968. eCollection 2020.
6
SIRe-Networks: Convolutional neural networks architectural extension for information preservation via skip/residual connections and interlaced auto-encoders.SIRe-Networks:通过跳传/残差连接和交错自编码器实现信息保留的卷积神经网络架构扩展。
Neural Netw. 2022 Sep;153:386-398. doi: 10.1016/j.neunet.2022.06.030. Epub 2022 Jun 27.
7
Unit-Vise: Deep Shallow Unit-Vise Residual Neural Networks With Transition Layer For Expert Level Skin Cancer Classification.整体-局部:具有迁移层的深层浅层整体-局部残差神经网络,用于专家级别的皮肤癌分类。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):1225-1234. doi: 10.1109/TCBB.2020.3039358. Epub 2022 Apr 1.
8
Multi-way backpropagation for training compact deep neural networks.多向反向传播训练紧凑的深度神经网络。
Neural Netw. 2020 Jun;126:250-261. doi: 10.1016/j.neunet.2020.03.001. Epub 2020 Mar 26.
9
Why ResNet Works? Residuals Generalize.为什么ResNet有效?残差能够泛化。
IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5349-5362. doi: 10.1109/TNNLS.2020.2966319. Epub 2020 Nov 30.
10
Robust and energy-efficient expression recognition based on improved deep ResNets.基于改进深度残差网络的鲁棒且节能的表情识别
Biomed Tech (Berl). 2019 Sep 25;64(5):519-528. doi: 10.1515/bmt-2018-0027.

引用本文的文献

1
A comprehensive comparative study of generative adversarial network architectures for synthetic computed tomography generation in the abdomen.用于腹部合成计算机断层扫描生成的生成对抗网络架构的全面比较研究。
Med Phys. 2025 Aug;52(8):e18038. doi: 10.1002/mp.18038.
2
Development and validation of a transformer-based deep learning model for predicting distant metastasis in non-small cell lung cancer using FDG PET/CT images.基于Transformer的深度学习模型的开发与验证,用于使用FDG PET/CT图像预测非小细胞肺癌的远处转移
Clin Transl Oncol. 2025 Aug 8. doi: 10.1007/s12094-025-04014-9.
3
Non-intrusive load monitoring based on time-enhanced multidimensional feature visualization.
基于时间增强多维特征可视化的非侵入式负载监测
Sci Rep. 2025 Feb 8;15(1):4800. doi: 10.1038/s41598-025-89191-x.
4
Deep learning model based on primary tumor to predict lymph node status in clinical stage IA lung adenocarcinoma: a multicenter study.基于原发性肿瘤的深度学习模型预测临床ⅠA期肺腺癌淋巴结状态的多中心研究
J Natl Cancer Cent. 2024 Feb 1;4(3):233-240. doi: 10.1016/j.jncc.2024.01.005. eCollection 2024 Sep.
5
ResNet and CWT Fusion: A New Paradigm for Optimized Heterogeneous Thin Reservoir Evaluation.残差网络(ResNet)与连续小波变换(CWT)融合:优化非均质薄储层评价的新范式
ACS Omega. 2024 Jan 12;9(4):4775-4791. doi: 10.1021/acsomega.3c08169. eCollection 2024 Jan 30.
6
Application of 1D ResNet for Multivariate Fault Detection on Semiconductor Manufacturing Equipment.一维残差网络在半导体制造设备多元故障检测中的应用。
Sensors (Basel). 2023 Nov 10;23(22):9099. doi: 10.3390/s23229099.
7
Prediction for blood lactate during exercise using an artificial intelligence-Enabled electrocardiogram: a feasibility study.使用人工智能心电图预测运动期间的血乳酸:一项可行性研究。
Front Physiol. 2023 Oct 25;14:1253598. doi: 10.3389/fphys.2023.1253598. eCollection 2023.
8
Analysis of super-enhancer using machine learning and its application to medical biology.基于机器学习的超级增强子分析及其在医学生物学中的应用。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad107.
9
Detection of Occluded Small Commodities Based on Feature Enhancement under Super-Resolution.基于超分辨率下特征增强的被遮挡小目标检测。
Sensors (Basel). 2023 Feb 22;23(5):2439. doi: 10.3390/s23052439.
10
When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification.当少即是多时:Shapley 值归因消融与增强学习在实用时间序列传感器数据分类中的应用。
PLoS One. 2022 Nov 23;17(11):e0277975. doi: 10.1371/journal.pone.0277975. eCollection 2022.