鲁棒异步随机梯度推送：强凸函数的渐近最优和与网络无关的性能

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions.

作者信息

Spiridonoff Artin, Olshevsky Alex, Paschalidis Ioannis Ch

机构信息

Division of Systems Engineering, Boston University, Boston, MA 02215, USA.

出版信息

J Mach Learn Res. 2020;21.

PMID:32989377

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7520166/

Abstract

We consider the standard model of distributed optimization of a sum of functions , where node in a network holds the function (). We allow for a harsh network model characterized by asynchronous updates, message delays, unpredictable message losses, and directed communication among nodes. In this setting, we analyze a modification of the Gradient-Push method for distributed optimization, assuming that (i) node is capable of generating gradients of its function () corrupted by zero-mean bounded-support additive noise at each step, (ii) () is strongly convex, and (iii) each () has Lipschitz gradients. We show that our proposed method asymptotically performs as well as the best bounds on centralized gradient descent that takes steps in the direction of the sum of the noisy gradients of all the functions (), …, () at each step.

摘要

我们考虑函数之和的分布式优化标准模型，其中网络中的节点 (i) 持有函数 (f_i())。我们允许一种严苛的网络模型，其特征为异步更新、消息延迟、不可预测的消息丢失以及节点间的定向通信。在此设定下，我们分析用于分布式优化的梯度推送方法的一种变体，假设：(i) 节点 (i) 能够在每一步生成其函数 (f_i()) 的梯度，该梯度被零均值有界支撑加性噪声所干扰；(ii) (f_i()) 是强凸函数；(iii) 每个 (f_i()) 具有Lipschitz梯度。我们表明，我们提出的方法在渐近意义上与集中式梯度下降的最佳界表现相同，集中式梯度下降在每一步朝着所有函数 (f_1())，…，(f_n()) 的噪声梯度之和的方向进行步长更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43cf/7520166/69e61f4f90c2/nihms-1608067-f0001.jpg

相似文献

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions.

J Mach Learn Res. 2020;21.

A Sharp Estimate on the Transient Time of Distributed Stochastic Gradient Descent.

IEEE Trans Automat Contr. 2022 Nov;67(11):5900-5915. doi: 10.1109/tac.2021.3126253. Epub 2021 Nov 9.

Push-Sum Distributed Online Optimization With Bandit Feedback.

IEEE Trans Cybern. 2022 Apr;52(4):2263-2273. doi: 10.1109/TCYB.2020.2999309. Epub 2022 Apr 5.

Hybrid-DCA: A double asynchronous approach for stochastic dual coordinate ascent.

J Parallel Distrib Comput. 2020 Sep;143:47-66. doi: 10.1016/j.jpdc.2020.04.002. Epub 2020 Apr 13.

Distributed Stochastic Constrained Composite Optimization Over Time-Varying Network With a Class of Communication Noise.

IEEE Trans Cybern. 2023 Jun;53(6):3561-3573. doi: 10.1109/TCYB.2021.3127278. Epub 2023 May 17.

Distributed Nesterov Gradient and Heavy-Ball Double Accelerated Asynchronous Optimization.

IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5723-5737. doi: 10.1109/TNNLS.2020.3027381. Epub 2021 Nov 30.

Dualityfree Methods for Stochastic Composition Optimization.

IEEE Trans Neural Netw Learn Syst. 2019 Apr;30(4):1205-1217. doi: 10.1109/TNNLS.2018.2866699. Epub 2018 Sep 12.

Distributed Optimization for Two Types of Heterogeneous Multiagent Systems.

IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1314-1324. doi: 10.1109/TNNLS.2020.2984584. Epub 2021 Mar 1.

Stochastic Strongly Convex Optimization via Distributed Epoch Stochastic Gradient Algorithm.

IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2344-2357. doi: 10.1109/TNNLS.2020.3004723. Epub 2021 Jun 2.

Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning.

IEEE Signal Process Mag. 2020 May;37(3):114-122. doi: 10.1109/msp.2020.2975212. Epub 2020 May 6.

引用本文的文献

A Sharp Estimate on the Transient Time of Distributed Stochastic Gradient Descent.

IEEE Trans Automat Contr. 2022 Nov;67(11):5900-5915. doi: 10.1109/tac.2021.3126253. Epub 2021 Nov 9.

Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning.

IEEE Signal Process Mag. 2020 May;37(3):114-122. doi: 10.1109/msp.2020.2975212. Epub 2020 May 6.

本文引用的文献

Federated learning of predictive models from federated Electronic Health Records.

Int J Med Inform. 2018 Apr;112:59-67. doi: 10.1016/j.ijmedinf.2018.01.007. Epub 2018 Jan 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鲁棒异步随机梯度推送：强凸函数的渐近最优和与网络无关的性能

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献