用于非凸优化的具有方差缩减的分布式随机梯度跟踪算法

Distributed Stochastic Gradient Tracking Algorithm With Variance Reduction for Non-Convex Optimization.

作者信息

Jiang Xia, Zeng Xianlin, Sun Jian, Chen Jie

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5310-5321. doi: 10.1109/TNNLS.2022.3170944. Epub 2023 Sep 1.

DOI:10.1109/TNNLS.2022.3170944

Abstract

This article proposes a distributed stochastic algorithm with variance reduction for general smooth non-convex finite-sum optimization, which has wide applications in signal processing and machine learning communities. In distributed setting, a large number of samples are allocated to multiple agents in the network. Each agent computes local stochastic gradient and communicates with its neighbors to seek for the global optimum. In this article, we develop a modified variance reduction technique to deal with the variance introduced by stochastic gradients. Combining gradient tracking and variance reduction techniques, this article proposes a distributed stochastic algorithm, gradient tracking algorithm with variance reduction (GT-VR), to solve large-scale non-convex finite-sum optimization over multiagent networks. A complete and rigorous proof shows that the GT-VR algorithm converges to the first-order stationary points with O(1/k) convergence rate. In addition, we provide the complexity analysis of the proposed algorithm. Compared with some existing first-order methods, the proposed algorithm has a lower O(PMϵ) gradient complexity under some mild condition. By comparing state-of-the-art algorithms and GT-VR in numerical simulations, we verify the efficiency of the proposed algorithm.

摘要

本文提出了一种用于一般光滑非凸有限和优化的具有方差缩减的分布式随机算法，该算法在信号处理和机器学习领域有广泛应用。在分布式环境中，大量样本被分配给网络中的多个智能体。每个智能体计算局部随机梯度，并与邻居进行通信以寻求全局最优解。在本文中，我们开发了一种改进的方差缩减技术来处理随机梯度引入的方差。结合梯度跟踪和方差缩减技术，本文提出了一种分布式随机算法——带方差缩减的梯度跟踪算法（GT-VR），用于解决多智能体网络上的大规模非凸有限和优化问题。一个完整且严格的证明表明，GT-VR算法以O(1/k)的收敛速率收敛到一阶驻点。此外，我们还给出了所提算法的复杂度分析。与一些现有的一阶方法相比，在所提算法在一些温和条件下具有更低的O(PMϵ)梯度复杂度。通过在数值模拟中比较现有最优算法和GT-VR，我们验证了所提算法的有效性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用于非凸优化的具有方差缩减的分布式随机梯度跟踪算法

Distributed Stochastic Gradient Tracking Algorithm With Variance Reduction for Non-Convex Optimization.

作者信息

出版信息

相似文献

用于非凸优化的具有方差缩减的分布式随机梯度跟踪算法

Distributed Stochastic Gradient Tracking Algorithm With Variance Reduction for Non-Convex Optimization.

作者信息

出版信息

相似文献