基于层次优化方法的高效学习率自适应。

Efficient learning rate adaptation based on hierarchical optimization approach.

机构信息

Korea Research Institute of Chemical Technology (KRICT), Republic of Korea.

出版信息

Neural Netw. 2022 Jun;150:326-335. doi: 10.1016/j.neunet.2022.02.014. Epub 2022 Feb 25.

DOI:10.1016/j.neunet.2022.02.014

Abstract

This paper proposes a new hierarchical approach to learning rate adaptation in gradient methods, called learning rate optimization (LRO). LRO formulates the learning rate adaption problem as a hierarchical optimization problem that minimizes the loss function with respect to the learning rate for current model parameters and gradients. Then, LRO optimizes the learning rate based on the alternating direction method of multipliers (ADMM). In the process of this learning rate optimization, LRO does not require any second-order information and probabilistic model, so it is highly efficient. Furthermore, LRO does not require any additional hyperparameters when compared to the vanilla gradient method with the simple exponential learning rate decay. In the experiments, we integrated LRO with vanilla SGD and Adam. Then, we compared their optimization performance with the state-of-the-art learning rate adaptation methods and also the most commonly-used adaptive gradient methods. The SGD and Adam with LRO outperformed all the competitors on the benchmark datasets in image classification tasks.

摘要

本文提出了一种新的梯度方法学习率自适应方法，称为学习率优化（LRO）。LRO 将学习率自适应问题表述为一个分层优化问题，该问题最小化了当前模型参数和梯度的损失函数对学习率的依赖关系。然后，LRO 基于交替方向乘子法（ADMM）优化学习率。在这个学习率优化过程中，LRO 不需要任何二阶信息和概率模型，因此效率很高。此外，与使用简单指数学习率衰减的香草 SGD 相比，LRO 不需要任何额外的超参数。在实验中，我们将 LRO 与香草 SGD 和 Adam 集成在一起，然后将它们的优化性能与最先进的学习率自适应方法以及最常用的自适应梯度方法进行了比较。在图像分类任务的基准数据集上，LRO 与 SGD 和 Adam 的组合优于所有竞争对手。

相似文献

Efficient learning rate adaptation based on hierarchical optimization approach.基于层次优化方法的高效学习率自适应。

Neural Netw. 2022 Jun;150:326-335. doi: 10.1016/j.neunet.2022.02.014. Epub 2022 Feb 25.

HyAdamC: A New Adam-Based Hybrid Optimization Algorithm for Convolution Neural Networks.HyAdamC：一种用于卷积神经网络的基于 Adam 的新型混合优化算法。

Sensors (Basel). 2021 Jun 12;21(12):4054. doi: 10.3390/s21124054.

PID Controller-Based Stochastic Optimization Acceleration for Deep Neural Networks.基于 PID 控制器的深度神经网络随机优化加速。

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5079-5091. doi: 10.1109/TNNLS.2019.2963066. Epub 2020 Nov 30.

Evolutionary optimization of a hierarchical object recognition model.层次化目标识别模型的进化优化

IEEE Trans Syst Man Cybern B Cybern. 2005 Jun;35(3):426-37. doi: 10.1109/tsmcb.2005.846649.

A novel adaptive momentum method for medical image classification using convolutional neural network.基于卷积神经网络的医学图像分类自适应动量方法

BMC Med Imaging. 2022 Mar 1;22(1):34. doi: 10.1186/s12880-022-00755-z.

diffGrad: An Optimization Method for Convolutional Neural Networks.差分梯度法：卷积神经网络的一种优化方法。

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4500-4511. doi: 10.1109/TNNLS.2019.2955777. Epub 2020 Oct 29.

Anomalous diffusion dynamics of learning in deep neural networks.深度学习网络中学习的异常扩散动力学。

Neural Netw. 2022 May;149:18-28. doi: 10.1016/j.neunet.2022.01.019. Epub 2022 Feb 3.

Deep learning for computational structural optimization.深度学习在计算结构优化中的应用。

ISA Trans. 2020 Aug;103:177-191. doi: 10.1016/j.isatra.2020.03.033. Epub 2020 Apr 10.

Reweighted Alternating Direction Method of Multipliers for DNN weight pruning.基于重加权交替方向乘子法的 DNN 权值剪枝。

Neural Netw. 2024 Nov;179:106534. doi: 10.1016/j.neunet.2024.106534. Epub 2024 Jul 14.

Discretely-constrained deep network for weakly supervised segmentation.基于离散约束的深度网络的弱监督分割。

Neural Netw. 2020 Oct;130:297-308. doi: 10.1016/j.neunet.2020.07.011. Epub 2020 Jul 18.

引用本文的文献

Enhancing chemical synthesis research with NLP: Word embeddings for chemical reagent identification-A case study on nano-FeCu.利用自然语言处理技术加强化学合成研究：用于化学试剂识别的词嵌入——以纳米铁铜为例

iScience. 2024 Aug 29;27(10):110780. doi: 10.1016/j.isci.2024.110780. eCollection 2024 Oct 18.

Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat.遗传参数和超参数估计是面包小麦氮利用效率的基础。

Int J Mol Sci. 2023 Sep 19;24(18):14275. doi: 10.3390/ijms241814275.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于层次优化方法的高效学习率自适应。

Efficient learning rate adaptation based on hierarchical optimization approach.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献