Department of Mechanical Engineering, Iowa State University, Ames, 50011, IA, USA; Department of Computer Science, Iowa State University, Ames, 50011, IA, USA.
Department of Mechanical Engineering, Iowa State University, Ames, 50011, IA, USA.
Neural Netw. 2024 Mar;171:25-39. doi: 10.1016/j.neunet.2023.11.057. Epub 2023 Nov 27.
Decentralized deep learning algorithms leverage peer-to-peer communication of model parameters and/or gradients over communication graphs among the learning agents with access to their private data sets. The majority of the studies in this area focus on achieving high accuracy, with many at the expense of increased communication overhead among the agents. However, large peer-to-peer communication overhead often becomes a practical challenge, especially in harsh environments such as for an underwater sensor network. In this paper, we aim to reduce communication overhead while achieving similar performance as the state-of-the-art algorithms. To achieve this, we use the concept of Minimum Connected Dominating Set from graph theory that is applied in ad hoc wireless networks to address communication overhead issues. Specifically, we propose a new decentralized deep learning algorithm called minimum connected Dominating Set Model Aggregation (DSMA). We investigate the efficacy of our method for different communication graph topologies with a small to large number of agents using varied neural network model architectures. Empirical results on benchmark data sets show a significant (up to 100X) reduction in communication time while preserving the accuracy or in some cases, increasing it compared to the state-of-the-art methods. We also present an analysis to show the convergence of our proposed algorithm.
去中心化深度学习算法利用对等通信,通过学习代理之间的通信图来共享模型参数和/或梯度,这些代理可以访问其私有数据集。该领域的大多数研究都集中在提高准确性上,许多研究都以增加代理之间的通信开销为代价。然而,大型的对等通信开销往往成为一个实际的挑战,特别是在水下传感器网络等恶劣环境中。在本文中,我们旨在在实现与现有最先进算法相似性能的同时,降低通信开销。为了实现这一目标,我们使用图论中的最小连通支配集概念,该概念应用于自组织无线网络来解决通信开销问题。具体来说,我们提出了一种新的去中心化深度学习算法,称为最小连通支配集模型聚合(DSMA)。我们使用不同的神经网络模型架构,针对具有少量到大量代理的不同通信图拓扑结构,研究了我们方法的有效性。基准数据集上的实验结果表明,与最先进的方法相比,我们的方法在保持准确性的情况下(在某些情况下甚至提高了准确性),通信时间显著减少(高达 100 倍)。我们还提出了一种分析方法来展示我们提出的算法的收敛性。