Li Pengyu, Guo Chengwei, Xing Yanxia, Shi Yingji, Feng Lei, Zhou Fanqin
6G Research Center, China Telecom Research Institute, Beijing, 102209, China.
State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
Sci Rep. 2024 Feb 26;14(1):4663. doi: 10.1038/s41598-024-53193-y.
Wireless traffic prediction is vital for intelligent cellular network operations, such as load-aware resource management and predictive control. Traditional centralized training addresses this but poses issues like excessive data transmission, disregarding delays, and user privacy. Traditional federated learning methods can meet the requirement of jointly training models while protecting the privacy of all parties' data. However, challenges arise when the local data features among participating parties exhibit inconsistency, making the training process difficult to sustain. Our study introduces an innovative framework for wireless traffic prediction based on split learning (SL) and vertical federated learning. Multiple edge clients collaboratively train high-quality prediction models by utilizing diverse traffic data while maintaining the confidentiality of raw data locally. Each participant individually trains dimension-specific prediction models with their respective data, and the outcomes are aggregated through collaboration. A partially global model is formed and shared among clients to address statistical heterogeneity in distributed machine learning. Extensive experiments on real-world datasets demonstrate our method's superiority over current approaches, showcasing its potential for network traffic prediction and accurate forecasting.
无线流量预测对于智能蜂窝网络运营至关重要,例如负载感知资源管理和预测控制。传统的集中式训练解决了这一问题,但带来了诸如数据传输过多、忽视延迟和用户隐私等问题。传统的联邦学习方法可以在保护各方数据隐私的同时满足联合训练模型的要求。然而,当参与方之间的本地数据特征表现出不一致时,就会出现挑战,使训练过程难以持续。我们的研究引入了一种基于分裂学习(SL)和纵向联邦学习的无线流量预测创新框架。多个边缘客户端通过利用不同的流量数据协作训练高质量的预测模型,同时在本地保持原始数据的保密性。每个参与者使用各自的数据单独训练特定维度的预测模型,并通过协作汇总结果。形成一个部分全局模型并在客户端之间共享,以解决分布式机器学习中的统计异质性问题。在真实世界数据集上进行的大量实验证明了我们的方法优于当前方法,展示了其在网络流量预测和准确预报方面的潜力。