Suppr超能文献

用于深度神经网络的带动量和能量的分数阶随机梯度下降法

Fractional-order stochastic gradient descent method with momentum and energy for deep neural networks.

作者信息

Zhou Xingwen, You Zhenghao, Sun Weiguo, Zhao Dongdong, Yan Shi

机构信息

School of Information Science and Engineering, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China; School of Nuclear Science and Technology, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China.

School of Information Science and Engineering, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China.

出版信息

Neural Netw. 2025 Jan;181:106810. doi: 10.1016/j.neunet.2024.106810. Epub 2024 Oct 19.

Abstract

In this paper, a novel fractional-order stochastic gradient descent with momentum and energy (FOSGDME) approach is proposed. Specifically, to address the challenge of converging to a real extreme point encountered by the existing fractional gradient algorithms, a novel fractional-order stochastic gradient descent (FOSGD) method is presented by modifying the definition of the Caputo fractional-order derivative. A FOSGD with moment (FOSGDM) is established by incorporating momentum information to accelerate the convergence speed and accuracy further. In addition, to improve the robustness and accuracy, a FOSGD with moment and energy is established by further introducing energy formation. The extensive experimental results on the image classification CIFAR-10 dataset obtained with ResNet and DenseNet demonstrate that the proposed FOSGD, FOSGDM and FOSGDME algorithms are superior to the integer order optimization algorithms, and achieve state-of-the-art performance.

摘要

本文提出了一种新颖的带动量和能量的分数阶随机梯度下降(FOSGDME)方法。具体而言,为应对现有分数梯度算法在收敛到实际极值点时遇到的挑战,通过修改Caputo分数阶导数的定义,提出了一种新颖的分数阶随机梯度下降(FOSGD)方法。通过纳入动量信息进一步建立了带动量的FOSGD(FOSGDM),以加快收敛速度和提高收敛精度。此外,为提高鲁棒性和精度,通过进一步引入能量形式建立了带动量和能量的FOSGD。在使用ResNet和DenseNet的图像分类CIFAR-10数据集上获得的大量实验结果表明,所提出的FOSGD、FOSGDM和FOSGDME算法优于整数阶优化算法,并取得了最优性能。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验