基于图神经网络的去中心化学习方案。

A Graph Neural Network Based Decentralized Learning Scheme.

机构信息

College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China.

Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN), Hangzhou 310027, China.

出版信息

Sensors (Basel). 2022 Jan 28;22(3):1030. doi: 10.3390/s22031030.

DOI:10.3390/s22031030

PMID:35161776

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8839979/

Abstract

As an emerging paradigm considering data privacy and transmission efficiency, decentralized learning aims to acquire a global model using the training data distributed over many user devices. It is a challenging problem since link loss, partial device participation, and non-independent and identically distributed (non-iid) data distribution would all deteriorate the performance of decentralized learning algorithms. Existing work may restrict to linear models or show poor performance over non-iid data. Therefore, in this paper, we propose a decentralized learning scheme based on distributed parallel stochastic gradient descent (DPSGD) and graph neural network (GNN) to deal with the above challenges. Specifically, each user device participating in the learning task utilizes local training data to compute local stochastic gradients and updates its own local model. Then, each device utilizes the GNN model and exchanges the model parameters with its neighbors to reach the average of resultant global models. The iteration repeats until the algorithm converges. Extensive simulation results over both iid and non-iid data validate the algorithm's convergence to near optimal results and robustness to both link loss and partial device participation.

摘要

作为一种考虑数据隐私和传输效率的新兴范例，去中心化学习旨在使用分布在许多用户设备上的训练数据获取全局模型。由于链路丢失、部分设备参与以及非独立同分布（non-iid）数据分布，这是一个具有挑战性的问题，所有这些都会降低去中心化学习算法的性能。现有工作可能仅限于线性模型，或者在非iid 数据上表现不佳。因此，在本文中，我们提出了一种基于分布式并行随机梯度下降（DPSGD）和图神经网络（GNN）的去中心化学习方案，以应对上述挑战。具体来说，参与学习任务的每个用户设备都利用本地训练数据来计算本地随机梯度，并更新其本地模型。然后，每个设备利用 GNN 模型并与邻居交换模型参数，以达到最终全局模型的平均值。算法迭代重复，直到收敛。在 iid 和 non-iid 数据上的大量仿真结果验证了算法收敛到接近最优结果的能力，以及对链路丢失和部分设备参与的鲁棒性。