Fu Lele, Huang Sheng, Li Yuecheng, Chen Chuan, Zhang Chuanfu, Zheng Zibin
School of Systems Science and Engineering, Sun Yat-sen University, Guangzhou, China.
School of Systems Science and Engineering, Sun Yat-sen University, Guangzhou, China.
Neural Netw. 2025 Jul;187:107319. doi: 10.1016/j.neunet.2025.107319. Epub 2025 Mar 5.
Federated learning collaborates with multiple clients to train a global model, enhancing the model generalization while allowing the local data transmission-free and security. However, federated learning currently faces three intractable challenges: (1) The large number of model parameters result in an excessive communication burden. (2) The non-independently and identically distributed local data induces the degradation of global model. (3) The model heterogeneity renders traditional federated aggregation infeasible. To dissipate the three difficulties, we propose to learn the global prompt in the low-rank tensor space (FedGPT) for heterogeneous federated learning. Specifically, we employ the prompts rather than the model parameters as the carrier of local knowledge to achieve the information interaction between multiple clients. Since the prompts only have a very small number of variables, the communication volume is greatly reduced. To cope with the data heterogeneity, the prompts from different clients are stacked into the third-order tensors, on which the tensor singular value decomposition is performed to extract the global information. Furthermore, the proposed FedGPT possesses the ability to handle the model heterogeneity, the local models of different sizes can transfer the knowledge with the help of the prompts to improve the performance. Extensive experiments on three real-world datasets are conducted. Overall, FedGPT outperforms other state-of-the-art compared methods by up to 13.21%, and achieves less than 3% of communication volume of FedAvg, demonstrating the superiority of the proposed FedGPT.
联邦学习与多个客户端协作训练全局模型,在保证本地数据无传输且安全的同时提高模型的泛化能力。然而,联邦学习目前面临三个棘手的挑战:(1)大量的模型参数导致通信负担过重。(2)非独立同分布的本地数据会导致全局模型性能下降。(3)模型的异质性使得传统的联邦聚合方法不可行。为了解决这三个难题,我们提出在低秩张量空间中学习全局提示(FedGPT)用于异构联邦学习。具体而言,我们采用提示而非模型参数作为本地知识的载体,以实现多个客户端之间的信息交互。由于提示仅包含非常少量的变量,通信量大大减少。为了应对数据异质性,将来自不同客户端的提示堆叠成三阶张量,并对其进行张量奇异值分解以提取全局信息。此外,所提出的FedGPT具有处理模型异质性的能力,不同大小的本地模型可以借助提示传递知识以提高性能。我们在三个真实世界数据集上进行了广泛的实验。总体而言,FedGPT比其他最先进的比较方法性能提升高达13.21%,并且通信量不到FedAvg的3%,证明了所提出的FedGPT的优越性。