Shawkat Mai, Ali Zainab H, Salem Mofreh, El-Desoky Ali
Department of Electronics and Communications Engineering, Mansoura Higher Institute for Engineering and Technology, Mansoura, Egypt.
Department of Embedded Network Systems and Technology, Faculty of Artificial Intelligence, Kafrelsheikh University, El-Geish St, Kafrelsheikh, 33516, Egypt.
Sci Rep. 2025 Apr 23;15(1):14069. doi: 10.1038/s41598-025-96468-8.
Federated learning (FL) is a promising approach that addresses privacy, and scalability concerns in contrast to traditional centralized methods. Challenges such as personalization and data heterogeneity issues remain critical. Clustered federated learning (CFL) has been proposed as a promising approach to alleviate these issues by establishing specialized global models for sets of similar users. Although CFL enhances adaptability to highly statistically heterogeneous environments, it may suffer from real-time distribution changes due to limitations in fixed cluster configurations. This study presents the robust model of personalized federated distillation (RMPFD), a personalized and privacy-enhanced framework. The RMPFD framework employs an adaptive hierarchical clustering strategy to generate semi-global models by grouping clients with similar data distributions, allowing them to train independently. Meta-learning is used in each cluster to enhance the personalization of the local models and the classification accuracy of the non-independent and Identically distributed (non-IID) data distributions. Experimental evaluations conducted on CIFAR- 10, CIFAR- 100, Fashion-MNIST and Enron email datasets reveal that RMPFD reduces communication overhead by approximately 15% and 20%, compared to Federated Averaging (FedAvg) and other baseline techniques. Moreover, the RMPFD framework improves the convergence rates and classification accuracy, leading to an improvement of over 12% in performance compared to traditional FL methods.
联邦学习(FL)是一种很有前景的方法,与传统的集中式方法相比,它解决了隐私和可扩展性问题。诸如个性化和数据异质性问题等挑战仍然至关重要。聚类联邦学习(CFL)已被提出作为一种有前景的方法,通过为相似用户集建立专门的全局模型来缓解这些问题。尽管CFL增强了对高度统计异质环境的适应性,但由于固定聚类配置的限制,它可能会受到实时分布变化的影响。本研究提出了个性化联邦蒸馏的鲁棒模型(RMPFD),这是一个个性化且增强隐私的框架。RMPFD框架采用自适应分层聚类策略,通过对具有相似数据分布的客户端进行分组来生成半全局模型,使它们能够独立训练。在每个聚类中使用元学习来增强局部模型的个性化以及非独立同分布(non-IID)数据分布的分类准确性。在CIFAR-10、CIFAR-100、Fashion-MNIST和安然电子邮件数据集上进行的实验评估表明,与联邦平均(FedAvg)和其他基线技术相比,RMPFD将通信开销降低了约15%和20%。此外,RMPFD框架提高了收敛速度和分类准确性,与传统的联邦学习方法相比,性能提高了超过12%。