深度神经网络中的两阶段切换优化策略。

Two-Phase Switching Optimization Strategy in Deep Neural Networks.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):330-339. doi: 10.1109/TNNLS.2020.3027750. Epub 2022 Jan 5.

DOI:10.1109/TNNLS.2020.3027750

Abstract

Optimization in a deep neural network is always challenging due to the vanishing gradient problem and intensive fine-tuning of network hyperparameters. Inspired by multistage decision control systems, the stochastic diagonal approximate greatest descent (SDAGD) algorithm is proposed in this article to seek for optimal learning weights using a two-phase switching optimization strategy. The proposed optimizer controls the relative step length derived based on the long-term optimal trajectory and adopts the diagonal approximated Hessian for efficient weight update. In Phase-I, it computes the greatest step length at the boundary of each local spherical search region and, subsequently, descends rapidly toward the direction of an optimal solution. In Phase-II, it switches to an approximate Newton method automatically once it is closer to the optimal solution to achieve fast convergence. The experiments show that SDAGD produces steeper learning curves and achieves lower misclassification rates compared with other optimization techniques. Implementation of the proposed optimizer to deeper networks is also investigated in this article to study the vanishing gradient problem.

摘要

由于梯度消失问题和网络超参数的精细调整，深度神经网络的优化一直具有挑战性。受多阶段决策控制系统的启发，本文提出了随机对角近似最大下降（SDAGD）算法，使用两阶段切换优化策略来寻找最优学习权重。所提出的优化器基于长期最优轨迹控制所导出的相对步长，并采用对角近似海森矩阵进行有效的权重更新。在第一阶段，它在每个局部球形搜索区域的边界处计算最大步长，然后快速向最优解的方向下降。在第二阶段，一旦接近最优解，它会自动切换到近似牛顿法以实现快速收敛。实验表明，与其他优化技术相比，SDAGD 产生了更陡峭的学习曲线，并实现了更低的错误分类率。本文还研究了将所提出的优化器应用于更深的网络，以研究梯度消失问题。

相似文献

Two-Phase Switching Optimization Strategy in Deep Neural Networks.深度神经网络中的两阶段切换优化策略。

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):330-339. doi: 10.1109/TNNLS.2020.3027750. Epub 2022 Jan 5.

A Novel Learning Algorithm to Optimize Deep Neural Networks: Evolved Gradient Direction Optimizer (EVGO).一种优化深度神经网络的新型学习算法：进化梯度方向优化器（EVGO）。

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):685-694. doi: 10.1109/TNNLS.2020.2979121. Epub 2021 Feb 4.

Distributed Newton Methods for Deep Neural Networks.深度神经网络的分布式牛顿方法。

Neural Comput. 2018 Jun;30(6):1673-1724. doi: 10.1162/neco_a_01088. Epub 2018 Apr 13.

An ordered-subsets proximal preconditioned gradient algorithm for edge-preserving PET image reconstruction.一种用于边缘保持 PET 图像重建的有序子集近端预条件梯度算法。

Med Phys. 2013 May;40(5):052503. doi: 10.1118/1.4801898.

The Whale Optimization Algorithm Approach for Deep Neural Networks.鲸鱼优化算法在深度神经网络中的应用。

Sensors (Basel). 2021 Nov 30;21(23):8003. doi: 10.3390/s21238003.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Automatic, dynamic, and nearly optimal learning rate specification via local quadratic approximation.通过局部二次逼近实现自动、动态和近乎最优的学习率指定。

Neural Netw. 2021 Sep;141:11-29. doi: 10.1016/j.neunet.2021.03.025. Epub 2021 Mar 26.

diffGrad: An Optimization Method for Convolutional Neural Networks.差分梯度法：卷积神经网络的一种优化方法。

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4500-4511. doi: 10.1109/TNNLS.2019.2955777. Epub 2020 Oct 29.

Preconditioned Stochastic Gradient Descent.预处理随机梯度下降。

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1454-1466. doi: 10.1109/TNNLS.2017.2672978. Epub 2017 Mar 9.

Adaptive habitat biogeography-based optimizer for optimizing deep CNN hyperparameters in image classification.基于自适应栖息地生物地理学的优化器，用于优化图像分类中的深度卷积神经网络超参数

Heliyon. 2024 Mar 21;10(7):e28147. doi: 10.1016/j.heliyon.2024.e28147. eCollection 2024 Apr 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度神经网络中的两阶段切换优化策略。

Two-Phase Switching Optimization Strategy in Deep Neural Networks.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献