• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无限宽度双层ReLU神经网络的同伦松弛训练算法

Homotopy Relaxation Training Algorithms for Infinite-Width Two-Layer ReLU Neural Networks.

作者信息

Yang Yahong, Chen Qipin, Hao Wenrui

机构信息

Department of Mathematics, The Pennsylvania State University, University Park, State College, PA 16802, USA.

Amazon Prime Video, Seattle, MA 98109, USA.

出版信息

J Sci Comput. 2025 Feb;102(2). doi: 10.1007/s10915-024-02761-5. Epub 2025 Jan 3.

DOI:10.1007/s10915-024-02761-5
PMID:40365531
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12074661/
Abstract

In this paper, we present a novel training approach called the Homotopy Relaxation Training Algorithm (HRTA), aimed at accelerating the training process in contrast to traditional methods. Our algorithm incorporates two key mechanisms: one involves building a homotopy activation function that seamlessly connects the linear activation function with the activation function; the other technique entails relaxing the homotopy parameter to enhance the training refinement process. We have conducted an in-depth analysis of this novel method within the context of the neural tangent kernel (NTK), revealing significantly improved convergence rates. Our experimental results, especially when considering networks with larger widths, validate the theoretical conclusions. This proposed HRTA exhibits the potential for other activation functions and deep neural networks.

摘要

在本文中,我们提出了一种名为同伦松弛训练算法(HRTA)的新颖训练方法,旨在与传统方法相比加速训练过程。我们的算法包含两个关键机制:一个是构建一个同伦激活函数,该函数将线性激活函数与激活函数无缝连接;另一种技术是放松同伦参数以增强训练优化过程。我们在神经切线核(NTK)的背景下对这种新方法进行了深入分析,结果显示收敛速度有显著提高。我们的实验结果,特别是考虑宽度较大的网络时,验证了理论结论。所提出的HRTA在其他激活函数和深度神经网络方面展现出潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/77c64f29564b/nihms-2066696-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/0e677264de95/nihms-2066696-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/3c0ffd578032/nihms-2066696-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/87c8e5c079bb/nihms-2066696-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/39ac0da09afb/nihms-2066696-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/62072dc4a53e/nihms-2066696-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/122a8ea0ce7e/nihms-2066696-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/a4618b6947b3/nihms-2066696-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/fb98dcc52d10/nihms-2066696-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/77c64f29564b/nihms-2066696-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/0e677264de95/nihms-2066696-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/3c0ffd578032/nihms-2066696-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/87c8e5c079bb/nihms-2066696-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/39ac0da09afb/nihms-2066696-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/62072dc4a53e/nihms-2066696-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/122a8ea0ce7e/nihms-2066696-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/a4618b6947b3/nihms-2066696-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/fb98dcc52d10/nihms-2066696-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7720/12074661/77c64f29564b/nihms-2066696-f0009.jpg

相似文献

1
Homotopy Relaxation Training Algorithms for Infinite-Width Two-Layer ReLU Neural Networks.无限宽度双层ReLU神经网络的同伦松弛训练算法
J Sci Comput. 2025 Feb;102(2). doi: 10.1007/s10915-024-02761-5. Epub 2025 Jan 3.
2
Convergence of deep convolutional neural networks.深度卷积神经网络的融合。
Neural Netw. 2022 Sep;153:553-563. doi: 10.1016/j.neunet.2022.06.031. Epub 2022 Jun 30.
3
A homotopy training algorithm for fully connected neural networks.一种用于全连接神经网络的同伦训练算法。
Proc Math Phys Eng Sci. 2019 Nov;475(2231):20190662. doi: 10.1098/rspa.2019.0662. Epub 2019 Nov 13.
4
Random Sketching for Neural Networks With ReLU.ReLU 神经网络的随机草图。
IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):748-762. doi: 10.1109/TNNLS.2020.2979228. Epub 2021 Feb 4.
5
Approximation of smooth functionals using deep ReLU networks.使用深度 ReLU 网络逼近光滑泛函。
Neural Netw. 2023 Sep;166:424-436. doi: 10.1016/j.neunet.2023.07.012. Epub 2023 Jul 18.
6
Predicting the outputs of finite deep neural networks trained with noisy gradients.预测使用噪声梯度训练的有限深度神经网络的输出。
Phys Rev E. 2021 Dec;104(6-1):064301. doi: 10.1103/PhysRevE.104.064301.
7
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
8
On a natural homotopy between linear and nonlinear single-layer networks.关于线性与非线性单层网络之间的自然同伦。
IEEE Trans Neural Netw. 1996;7(2):307-17. doi: 10.1109/72.485634.
9
Improved Linear Convergence of Training CNNs With Generalizability Guarantees: A One-Hidden-Layer Case.具有泛化保证的卷积神经网络训练的改进线性收敛性:单隐藏层情况
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2622-2635. doi: 10.1109/TNNLS.2020.3007399. Epub 2021 Jun 2.
10
Simple, fast, and flexible framework for matrix completion with infinite width neural networks.具有无限宽度神经网络的矩阵完成的简单、快速和灵活框架。
Proc Natl Acad Sci U S A. 2022 Apr 19;119(16):e2115064119. doi: 10.1073/pnas.2115064119. Epub 2022 Apr 11.

引用本文的文献

1
Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations.自动微分在训练用于求解微分方程的神经网络中至关重要。
J Sci Comput. 2025 Aug;104(2). doi: 10.1007/s10915-025-02965-3. Epub 2025 Jun 24.

本文引用的文献

1
Near-optimal deep neural network approximation for Korobov functions with respect to L and H norms.关于 L 和 H 范数的 Korobov 函数的近最优深度神经网络逼近。
Neural Netw. 2024 Dec;180:106702. doi: 10.1016/j.neunet.2024.106702. Epub 2024 Sep 6.
2
Locally adaptive activation functions with slope recovery for deep and physics-informed neural networks.用于深度和物理感知神经网络的具有斜率恢复的局部自适应激活函数。
Proc Math Phys Eng Sci. 2020 Jul;476(2239):20200334. doi: 10.1098/rspa.2020.0334. Epub 2020 Jul 15.
3
A homotopy training algorithm for fully connected neural networks.一种用于全连接神经网络的同伦训练算法。
Proc Math Phys Eng Sci. 2019 Nov;475(2231):20190662. doi: 10.1098/rspa.2019.0662. Epub 2019 Nov 13.