IEEE J Biomed Health Inform. 2024 Jun;28(6):3349-3360. doi: 10.1109/JBHI.2023.3306425. Epub 2024 Jun 6.
The issue of data privacy protection must be considered in distributed federated learning (FL) so as to ensure that sensitive information is not leaked. In this article, we propose a two-stage differential privacy (DP) framework for FL based on edge intelligence. Various levels of privacy preservation can be provided according to the degree of data sensitivity. In the first stage, the randomized response mechanism is used to perturb the original feature data by the user terminal for data desensitization, and the user can self-regulate the level of privacy preservation. In the second stage, noise is added to the local models by the edge server to further guarantee the privacy of the models. Finally, the model updates are aggregated in the cloud. In order to evaluate the performance of the proposed end-edge-cloud FL framework in terms of training accuracy and convergence, extensive experiments are conducted on a real electrocardiogram (ECG) signal dataset. Bi-directional long-short-term memory (BiLSTM) neural network is adopted to training classification model. The effect of different combinations of feature perturbation and noise addition on the model accuracy is analyzed depending on different privacy budgets and parameters. The experimental results demonstrate that the proposed privacy-preserving framework provides good accuracy and convergence while ensuring privacy.
在分布式联邦学习 (FL) 中必须考虑数据隐私保护问题,以确保敏感信息不被泄露。本文提出了一种基于边缘智能的 FL 两阶段差分隐私 (DP) 框架。根据数据敏感度的不同,可以提供不同级别的隐私保护。在第一阶段,用户终端使用随机响应机制对原始特征数据进行扰动,实现数据脱敏,用户可以自行调节隐私保护级别。在第二阶段,边缘服务器向本地模型添加噪声,进一步保证模型的隐私性。最后,在云端聚合模型更新。为了评估所提出的端-边-云联邦学习框架在训练准确性和收敛性方面的性能,在真实的心电图 (ECG) 信号数据集上进行了广泛的实验。采用双向长短时记忆 (BiLSTM) 神经网络训练分类模型。根据不同的隐私预算和参数,分析了不同特征扰动和噪声添加组合对模型准确性的影响。实验结果表明,所提出的隐私保护框架在确保隐私的同时,提供了良好的准确性和收敛性。