Gao Xiaoyuan, Mi Wei, Feng Xirui
College of Humanities and Law, Tianjin University of Science and Technology, Tianjin, 300222, China.
School of Law, Tianjin University, Tianjin, 300072, China.
Sci Rep. 2025 May 13;15(1):16558. doi: 10.1038/s41598-025-01575-1.
With the rapid advancement of intelligent healthcare, the privacy protection of personal health data has become a critical issue that urgently needs to be addressed. To tackle this challenge, this work proposes a Differential Privacy-based Generative Adversarial Network for Healthcare Data (DP-GAN-HD), which integrates Generative Adversarial Networks (GANs) with differential privacy mechanisms to ensure secure data publishing. The method addresses the challenge of efficiently publishing personal health data under privacy protection. The proposed method employs a multi-generator architecture and optimizes generator parameters through gradient clipping and genetic algorithms, enhancing data privacy protection and the quality and utility of the generated data. Experimental results show that, with a privacy budget of 2.0, the accuracy of DP-GAN-HD on the Adult, Br2000, and Kaggle Cardiovascular Disease datasets reaches 0.784, 0.800, and 0.823, respectively. They all outperform other differential privacy models and are slightly lower than the real datasets, demonstrating a strong balance between privacy protection and data utility. Additionally, the model's accuracy gradually improves as the privacy budget increases. When a privacy budget is 3.0, DP-GAN-HD achieves its peak, performing nearly identically to real data. DP-GAN-HD demonstrates enhanced resistance to privacy attacks through its multi-generator framework and Gaussian noise perturbation mechanisms. These features collectively reduce privacy leakage risks while maintaining an effective balance between data utility and protection. Overall, the experimental results reveal that DP-GAN-HD excels in balancing privacy protection and data utility across different datasets, and prove its adaptability and effectiveness in intelligent healthcare applications.
随着智能医疗的迅速发展,个人健康数据的隐私保护已成为一个迫切需要解决的关键问题。为应对这一挑战,本文提出了一种基于差分隐私的医疗数据生成对抗网络(DP-GAN-HD),该网络将生成对抗网络(GANs)与差分隐私机制相结合,以确保安全的数据发布。该方法解决了在隐私保护下高效发布个人健康数据的挑战。所提出的方法采用多生成器架构,并通过梯度裁剪和遗传算法优化生成器参数,增强了数据隐私保护以及生成数据的质量和实用性。实验结果表明,在隐私预算为2.0的情况下,DP-GAN-HD在成人、Br2000和Kaggle心血管疾病数据集上的准确率分别达到0.784、0.800和0.823。它们均优于其他差分隐私模型,且略低于真实数据集,这表明在隐私保护和数据实用性之间实现了良好的平衡。此外,随着隐私预算的增加,该模型的准确率逐渐提高。当隐私预算为3.0时,DP-GAN-HD达到峰值,性能与真实数据几乎相同。DP-GAN-HD通过其多生成器框架和高斯噪声扰动机制展示了对隐私攻击的增强抵抗力。这些特性共同降低了隐私泄露风险,同时在数据实用性和保护之间保持了有效的平衡。总体而言,实验结果表明DP-GAN-HD在不同数据集上平衡隐私保护和数据实用性方面表现出色,并证明了其在智能医疗应用中的适应性和有效性。