• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

低阶项优先于 ResNet 及其变体和整个神经网络家族。

Low-degree term first in ResNet, its variants and the whole neural network family.

机构信息

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China; Mine Digitization Engineering Research Centre of Ministry of Education of the People's Republic of China, Xuzhou 221116, China.

出版信息

Neural Netw. 2022 Apr;148:155-165. doi: 10.1016/j.neunet.2022.01.012. Epub 2022 Jan 24.

DOI:10.1016/j.neunet.2022.01.012
PMID:35134597
Abstract

To explain the working mechanism of ResNet and its variants, this paper proposes a novel argument of shallow subnetwork first (SSF), essentially low-degree term first (LDTF), which also applies to the whole neural network family. A neural network with shortcut connections behaves as an ensemble of a number of subnetworks of differing depths. Among the subnetworks, the shallow subnetworks are trained firstly, having great effects on the performance of the neural network. The shallow subnetworks roughly correspond to low-degree polynomials, while the deep subnetworks are opposite. Based on Taylor expansion, SSF is consistent with LDTF. ResNet is in line with Taylor expansion: shallow subnetworks are trained firstly to keep low-degree terms, avoiding overfitting; deep subnetworks try to maintain high-degree terms, ensuring high description capacity. Experiments on ResNets and DenseNets show that shallow subnetworks are trained firstly and play important roles in the training of the networks. The experiments also reveal the reason why DenseNets outperform ResNets: The subnetworks playing vital roles in the training of the former are shallower than those in the training of the latter. Furthermore, LDTF can also be used to explain the working mechanism of other ResNet variants (SE-ResNets and SK-ResNets), and the common phenomena occurring in many neural networks.

摘要

为了解释 ResNet 及其变体的工作机制,本文提出了一个新的论点,即浅子网络优先(SSF),本质上是低阶项优先(LDTF),这也适用于整个神经网络家族。具有捷径连接的神经网络表现为一系列不同深度子网的集合。在这些子网中,浅层子网首先被训练,对神经网络的性能有很大的影响。浅层子网大致对应于低阶多项式,而深层子网则相反。基于泰勒展开,SSF 与 LDTF 一致。ResNet 符合泰勒展开:首先训练浅层子网以保持低阶项,避免过拟合;深层子网则试图保持高阶项,以确保高描述能力。在 ResNets 和 DenseNets 上的实验表明,浅层子网首先被训练,并在网络的训练中发挥重要作用。实验还揭示了 DenseNets 优于 ResNets 的原因:在前一种网络的训练中起关键作用的子网比在后一种网络的训练中起关键作用的子网更浅。此外,LDTF 也可用于解释其他 ResNet 变体(SE-ResNets 和 SK-ResNets)以及许多神经网络中常见的现象的工作机制。

相似文献

1
Low-degree term first in ResNet, its variants and the whole neural network family.低阶项优先于 ResNet 及其变体和整个神经网络家族。
Neural Netw. 2022 Apr;148:155-165. doi: 10.1016/j.neunet.2022.01.012. Epub 2022 Jan 24.
2
Robust and energy-efficient expression recognition based on improved deep ResNets.基于改进深度残差网络的鲁棒且节能的表情识别
Biomed Tech (Berl). 2019 Sep 25;64(5):519-528. doi: 10.1515/bmt-2018-0027.
3
ResNet and CycleGAN for pulse shape discrimination of He-4 detector pulses: Recovering pulses conventional algorithms fail to label unanimously.ResNet 和 CycleGAN 用于氦-4 探测器脉冲的脉冲形状判别:恢复传统算法无法一致标记的脉冲。
Appl Radiat Isot. 2021 Oct;176:109819. doi: 10.1016/j.apradiso.2021.109819. Epub 2021 Jun 9.
4
Can neural networks benefit from objectives that encourage iterative convergent computations? A case study of ResNets and object classification.神经网络能否从鼓励迭代收敛计算的目标中受益?ResNets 和目标分类的案例研究。
PLoS One. 2024 Mar 21;19(3):e0293440. doi: 10.1371/journal.pone.0293440. eCollection 2024.
5
Evolutionary Shallowing Deep Neural Networks at Block Levels.在块级别上对深度神经网络进行演化。
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4635-4647. doi: 10.1109/TNNLS.2021.3059529. Epub 2022 Aug 31.
6
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.GSNFS:肺癌表达数据的基因子网生物标志物识别
BMC Med Genomics. 2016 Dec 5;9(Suppl 3):70. doi: 10.1186/s12920-016-0231-4.
7
A Critical Test of Deep Convolutional Neural Networks' Ability to Capture Recurrent Processing in the Brain Using Visual Masking.使用视觉掩蔽对深度卷积神经网络在大脑中捕获重复处理能力的关键测试。
J Cogn Neurosci. 2022 Nov 1;34(12):2390-2405. doi: 10.1162/jocn_a_01914.
8
A New Generation of ResNet Model Based on Artificial Intelligence and Few Data Driven and Its Construction in Image Recognition Model.基于人工智能和少量数据驱动的新一代 ResNet 模型及其在图像识别模型中的构建。
Comput Intell Neurosci. 2022 Mar 19;2022:5976155. doi: 10.1155/2022/5976155. eCollection 2022.
9
Deep Residual Networks for User Authentication via Hand-Object Manipulations.基于手-物操作的深度残差网络用户认证。
Sensors (Basel). 2021 Apr 23;21(9):2981. doi: 10.3390/s21092981.
10
Why ResNet Works? Residuals Generalize.为什么ResNet有效?残差能够泛化。
IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5349-5362. doi: 10.1109/TNNLS.2020.2966319. Epub 2020 Nov 30.

引用本文的文献

1
Machine Learning and Deep Learning Hybrid Approach Based on Muscle Imaging Features for Diagnosis of Esophageal Cancer.基于肌肉成像特征的机器学习与深度学习混合方法用于食管癌诊断
Diagnostics (Basel). 2025 Jul 8;15(14):1730. doi: 10.3390/diagnostics15141730.
2
PSFHSP-Net: an efficient lightweight network for identifying pubic symphysis-fetal head standard plane from intrapartum ultrasound images.PSFHSP-Net:一种用于识别产时超声图像中耻骨联合-胎儿头标准平面的高效轻量级网络。
Med Biol Eng Comput. 2024 Oct;62(10):2975-2986. doi: 10.1007/s11517-024-03111-1. Epub 2024 May 9.
3
Infodemic: Challenges and solutions in topic discovery and data process.
信息疫情:主题发现与数据处理中的挑战与解决方案
Arch Public Health. 2023 Sep 7;81(1):166. doi: 10.1186/s13690-023-01179-z.
4
[Advanced Faster RCNN: a non-contrast CT-based algorithm for detecting pancreatic lesions in multiple disease stages].[先进的更快区域卷积神经网络:一种基于非增强CT的多疾病阶段胰腺病变检测算法]
Nan Fang Yi Ke Da Xue Xue Bao. 2023 May 20;43(5):755-763. doi: 10.12122/j.issn.1673-4254.2023.05.11.
5
An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works.用于新冠病毒(COVID-19)检测的深度学习技术综述:方法、挑战及未来工作
Multimed Syst. 2023;29(3):1603-1627. doi: 10.1007/s00530-023-01083-0. Epub 2023 Mar 25.
6
Automatic Extraction of Power Lines from Aerial Images of Unmanned Aerial Vehicles.从无人机航空图像中自动提取电力线
Sensors (Basel). 2022 Aug 26;22(17):6431. doi: 10.3390/s22176431.