• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医疗保健中的隐私保护:对用于合成数据生成的深度学习方法的系统综述。

Preserving privacy in healthcare: A systematic review of deep learning approaches for synthetic data generation.

作者信息

Liu Yintong, Acharya U Rajendra, Tan Jen Hong

机构信息

NUS-ISS, 25 Heng Mui Keng Terrace, Singapore, 119615, Singapore.

Centre for Health Research, University of Southern Queensland, Springfield, Australia; School of Mathematics, Physics and Computing, University of Southern Queensland, Springfield, Australia.

出版信息

Comput Methods Programs Biomed. 2025 Mar;260:108571. doi: 10.1016/j.cmpb.2024.108571. Epub 2024 Dec 28.

DOI:10.1016/j.cmpb.2024.108571
PMID:39742693
Abstract

BACKGROUND

Data sharing in healthcare is vital for advancing research and personalized medicine. However, the process is hindered by privacy, ethical, and legal challenges associated with patient data. Synthetic data generation emerges as a promising solution, replicating statistical properties of real data while enhancing privacy protection.

METHODS

This systematic review examines deep learning techniques for synthetic data generation in healthcare, focusing on their ability to maintain data utility and enhance privacy. Studies from Scopus, Web of Science, PubMed, and IEEE databases published between 2019 and 2023 were analyzed. Key methods explored include Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Diffusion Models. Evaluation metrics encompass data resemblance, utility, and privacy preservation, with special attention to privacy-enhancing methods like differential privacy and federated learning.

RESULTS

GANs and VAEs demonstrated robust capabilities in generating realistic synthetic data for tabular, signal, image, and multi-modal datasets. Privacy-preserving approaches such as differential privacy and adversarial training significantly reduced re-identification risks while maintaining data fidelity. However, challenges persist in preserving temporal correlations, reducing biases, and aligning with regulatory frameworks, particularly for longitudinal and high-dimensional data.

CONCLUSION

Synthetic data generation holds significant potential for privacy-preserving data sharing in healthcare. Ongoing research is required to develop advanced algorithms and evaluation frameworks, ensuring synthetic data's quality and privacy. Collaboration between technologists and policymakers is essential to create comprehensive guidelines, fostering secure and effective data sharing in healthcare.

摘要

背景

医疗保健中的数据共享对于推进研究和个性化医疗至关重要。然而,这一过程受到与患者数据相关的隐私、伦理和法律挑战的阻碍。合成数据生成作为一种有前景的解决方案出现,它在增强隐私保护的同时复制真实数据的统计特性。

方法

本系统综述研究了医疗保健中用于合成数据生成的深度学习技术,重点关注其保持数据效用和增强隐私的能力。分析了2019年至2023年期间在Scopus、科学网、PubMed和IEEE数据库上发表的研究。探索的关键方法包括生成对抗网络(GAN)、变分自编码器(VAE)和扩散模型。评估指标包括数据相似性、效用和隐私保护,特别关注差分隐私和联邦学习等增强隐私的方法。

结果

GAN和VAE在为表格、信号、图像和多模态数据集生成逼真的合成数据方面表现出强大的能力。差分隐私和对抗训练等隐私保护方法在保持数据保真度的同时显著降低了重新识别风险。然而,在保留时间相关性、减少偏差以及与监管框架保持一致方面仍然存在挑战,特别是对于纵向和高维数据。

结论

合成数据生成在医疗保健中保护隐私的数据共享方面具有巨大潜力。需要持续开展研究以开发先进的算法和评估框架,确保合成数据的质量和隐私。技术专家和政策制定者之间的合作对于创建全面的指导方针至关重要,有助于促进医疗保健中安全有效的数据共享。

相似文献

1
Preserving privacy in healthcare: A systematic review of deep learning approaches for synthetic data generation.医疗保健中的隐私保护:对用于合成数据生成的深度学习方法的系统综述。
Comput Methods Programs Biomed. 2025 Mar;260:108571. doi: 10.1016/j.cmpb.2024.108571. Epub 2024 Dec 28.
2
On the Fidelity-Privacy Tradeoff of Synthetic Cancer Registry Data.合成癌症登记数据的保真度-隐私权衡。
Stud Health Technol Inform. 2024 Aug 22;316:621-625. doi: 10.3233/SHTI240490.
3
The urgent need to accelerate synthetic data privacy frameworks for medical research.加速医学研究合成数据隐私框架的迫切需求。
Lancet Digit Health. 2025 Feb;7(2):e157-e160. doi: 10.1016/S2589-7500(24)00196-1. Epub 2024 Nov 26.
4
Personal health data protection and intelligent healthcare applications under generative adversarial network.生成对抗网络下的个人健康数据保护与智能医疗应用
Sci Rep. 2025 May 13;15(1):16558. doi: 10.1038/s41598-025-01575-1.
5
Data Obfuscation Through Latent Space Projection for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection.通过潜在空间投影进行数据混淆以实现隐私保护的人工智能治理:医学诊断和金融欺诈检测案例研究
JMIRx Med. 2025 Mar 12;6:e70100. doi: 10.2196/70100.
6
Utility-based Analysis of Statistical Approaches and Deep Learning Models for Synthetic Data Generation With Focus on Correlation Structures: Algorithm Development and Validation.基于效用的统计方法和深度学习模型用于合成数据生成的分析,重点关注相关结构:算法开发与验证
JMIR AI. 2025 Mar 20;4:e65729. doi: 10.2196/65729.
7
Comprehensive evaluation framework for synthetic tabular data in health: fidelity, utility and privacy analysis of generative models with and without privacy guarantees.健康领域合成表格数据的综合评估框架:具有和不具有隐私保障的生成模型的保真度、效用和隐私分析
Front Digit Health. 2025 Apr 24;7:1576290. doi: 10.3389/fdgth.2025.1576290. eCollection 2025.
8
Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing.隐私保护生成式深度神经网络支持临床数据共享。
Circ Cardiovasc Qual Outcomes. 2019 Jul;12(7):e005122. doi: 10.1161/CIRCOUTCOMES.118.005122. Epub 2019 Jul 9.
9
Federated learning with differential privacy for breast cancer diagnosis enabling secure data sharing and model integrity.用于乳腺癌诊断的具有差分隐私的联邦学习,实现安全的数据共享和模型完整性。
Sci Rep. 2025 Apr 16;15(1):13061. doi: 10.1038/s41598-025-95858-2.
10
Privacy-Preserving Federated Learning Framework for Multi-Source Electronic Health Records Prognosis Prediction.用于多源电子健康记录预后预测的隐私保护联邦学习框架
Sensors (Basel). 2025 Apr 9;25(8):2374. doi: 10.3390/s25082374.

引用本文的文献

1
Enhancing security in electromagnetic radiation therapy using fuzzy graph theory.运用模糊图论增强电磁辐射治疗中的安全性。
Sci Rep. 2025 Apr 16;15(1):13139. doi: 10.1038/s41598-025-98110-z.