Li Zhaohua, Wang L, Chen Guangyao, Zhang Zhiqiang, Shafiq Muhammad, Gu Zhaoquan
IEEE J Biomed Health Inform. 2023 Feb;27(2):756-767. doi: 10.1109/JBHI.2022.3204455. Epub 2023 Feb 3.
A plethora of healthcare data is produced every day due to the proliferation of prominent technologies such as Internet of Medical Things (IoMT). Digital-driven smart devices like wearable watches, wristbands and bracelets are utilized extensively in modern healthcare applications. Mining valuable information from the data distributed at the owners' level is useful, but it is challenging to preserve data privacy. Federated learning (FL) has swiftly surged in popularity due to its efficacy in dealing privacy vulnerabilities. Recent studies have demonstrated that Gradient Inversion Attack (GIA) can reconstruct the input data by leaked gradients, previous work demonstrated the achievement of GIA in very limited scenarios, such as the label repetition rate of the target sample being low and batch sizes being smaller than 48. In this paper, a novel method of End-to-End Gradient Inversion (E2EGI) is proposed. Compared to the state-of-the-art method, E2EGI's Minimum Loss Combinatorial Optimization (MLCO) has the ability to realize reconstructed samples with higher similarity, and the Distributed Gradient Inversion algorithm can implement GIA with batch sizes of 8 to 256 on deep network models (such as ResNet-50) and ImageNet datasets. A new Label Reconstruction algorithm is developed that relies only on the gradient information of the target model, which can achieve a label reconstruction accuracy of 81% in one batch sample with a label repetition rate of 96%, a 27% improvement over the state-of-the-art method. This proposed work can underpin data security assessments for healthcare federated learning.
由于诸如医疗物联网(IoMT)等卓越技术的激增,每天都会产生大量的医疗保健数据。像可穿戴手表、腕带和手环等数字驱动的智能设备在现代医疗保健应用中得到了广泛应用。从所有者层面分布的数据中挖掘有价值的信息很有用,但保护数据隐私具有挑战性。联邦学习(FL)因其在处理隐私漏洞方面的有效性而迅速受到欢迎。最近的研究表明,梯度反转攻击(GIA)可以通过泄露的梯度来重建输入数据,先前的工作表明GIA仅在非常有限的场景中取得了成果,例如目标样本的标签重复率较低且批量大小小于48。本文提出了一种端到端梯度反转(E2EGI)的新方法。与最先进的方法相比,E2EGI的最小损失组合优化(MLCO)能够实现相似度更高的重建样本,并且分布式梯度反转算法可以在深度网络模型(如ResNet-50)和ImageNet数据集上以8到256的批量大小实现GIA。开发了一种仅依赖目标模型梯度信息的新标签重建算法,在标签重复率为96%的一批样本中,该算法可以实现81%的标签重建准确率,比最先进的方法提高了27%。这项提议的工作可以为医疗保健联邦学习的数据安全评估提供支持。