Zhou Xingwen, You Zhenghao, Sun Weiguo, Zhao Dongdong, Yan Shi
School of Information Science and Engineering, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China; School of Nuclear Science and Technology, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China.
School of Information Science and Engineering, Lanzhou University, 222 Tianshui South Road, Chengguan District, Lanzhou, Gansu Province, Lanzhou, 730000, Gansu, China.
Neural Netw. 2025 Jan;181:106810. doi: 10.1016/j.neunet.2024.106810. Epub 2024 Oct 19.
In this paper, a novel fractional-order stochastic gradient descent with momentum and energy (FOSGDME) approach is proposed. Specifically, to address the challenge of converging to a real extreme point encountered by the existing fractional gradient algorithms, a novel fractional-order stochastic gradient descent (FOSGD) method is presented by modifying the definition of the Caputo fractional-order derivative. A FOSGD with moment (FOSGDM) is established by incorporating momentum information to accelerate the convergence speed and accuracy further. In addition, to improve the robustness and accuracy, a FOSGD with moment and energy is established by further introducing energy formation. The extensive experimental results on the image classification CIFAR-10 dataset obtained with ResNet and DenseNet demonstrate that the proposed FOSGD, FOSGDM and FOSGDME algorithms are superior to the integer order optimization algorithms, and achieve state-of-the-art performance.
本文提出了一种新颖的带动量和能量的分数阶随机梯度下降(FOSGDME)方法。具体而言,为应对现有分数梯度算法在收敛到实际极值点时遇到的挑战,通过修改Caputo分数阶导数的定义,提出了一种新颖的分数阶随机梯度下降(FOSGD)方法。通过纳入动量信息进一步建立了带动量的FOSGD(FOSGDM),以加快收敛速度和提高收敛精度。此外,为提高鲁棒性和精度,通过进一步引入能量形式建立了带动量和能量的FOSGD。在使用ResNet和DenseNet的图像分类CIFAR-10数据集上获得的大量实验结果表明,所提出的FOSGD、FOSGDM和FOSGDME算法优于整数阶优化算法,并取得了最优性能。