Suppr超能文献

通过分数梯度加速梯度下降和Adam算法。

Accelerating gradient descent and Adam via fractional gradients.

作者信息

Shin Yeonjong, Darbon Jérôme, Karniadakis George Em

机构信息

Department of Mathematical Sciences, KAIST, Daejeon 34141, South Korea.

Division of Applied Mathematics, Brown University, Providence, RI 02912, USA.

出版信息

Neural Netw. 2023 Apr;161:185-201. doi: 10.1016/j.neunet.2023.01.002. Epub 2023 Jan 11.

Abstract

We propose a class of novel fractional-order optimization algorithms. We define a fractional-order gradient via the Caputo fractional derivatives that generalizes integer-order gradient. We refer it to as the Caputo fractional-based gradient, and develop an efficient implementation to compute it. A general class of fractional-order optimization methods is then obtained by replacing integer-order gradients with the Caputo fractional-based gradients. To give concrete algorithms, we consider gradient descent (GD) and Adam, and extend them to the Caputo fractional GD (CfGD) and the Caputo fractional Adam (CfAdam). We demonstrate the superiority of CfGD and CfAdam on several large scale optimization problems that arise from scientific machine learning applications, such as ill-conditioned least squares problem on real-world data and the training of neural networks involving non-convex objective functions. Numerical examples show that both CfGD and CfAdam result in acceleration over GD and Adam, respectively. We also derive error bounds of CfGD for quadratic functions, which further indicate that CfGD could mitigate the dependence on the condition number in the rate of convergence and results in significant acceleration over GD.

摘要

我们提出了一类新颖的分数阶优化算法。我们通过Caputo分数阶导数定义了分数阶梯度,它是整数阶梯度的推广。我们将其称为基于Caputo分数阶的梯度,并开发了一种有效的计算方法。然后,通过用基于Caputo分数阶的梯度替换整数阶梯度,得到了一类通用的分数阶优化方法。为了给出具体算法,我们考虑梯度下降(GD)和Adam,并将它们扩展为Caputo分数阶梯度下降(CfGD)和Caputo分数阶Adam(CfAdam)。我们在科学机器学习应用中出现的几个大规模优化问题上展示了CfGD和CfAdam的优越性,例如真实世界数据上的病态最小二乘问题以及涉及非凸目标函数的神经网络训练。数值例子表明,CfGD和CfAdam分别比GD和Adam有加速效果。我们还推导了二次函数的CfGD误差界,这进一步表明CfGD可以减轻收敛速度对条件数的依赖,并比GD有显著加速。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验