• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加速医学研究合成数据隐私框架的迫切需求。

The urgent need to accelerate synthetic data privacy frameworks for medical research.

作者信息

Arora Anmol, Wagner Siegfried Karl, Carpenter Robin, Jena Rajesh, Keane Pearse A

机构信息

School of Clinical Medicine, University of Cambridge, Cambridge, UK.

NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust, London, UK; Institute of Ophthalmology, University College London, London, UK.

出版信息

Lancet Digit Health. 2025 Feb;7(2):e157-e160. doi: 10.1016/S2589-7500(24)00196-1. Epub 2024 Nov 26.

DOI:10.1016/S2589-7500(24)00196-1
PMID:39603900
Abstract

Synthetic data, generated through artificial intelligence technologies such as generative adversarial networks and latent diffusion models, maintain aggregate patterns and relationships present in the real data the technologies were trained on without exposing individual identities, thereby mitigating re-identification risks. This approach has been gaining traction in biomedical research because of its ability to preserve privacy and enable dataset sharing between organisations. Although the use of synthetic data has become widespread in other domains, such as finance and high-energy physics, use in medical research raises novel issues. The use of synthetic data as a method of preserving the privacy of data used to train models requires that the data are high fidelity with the original data to preserve utility, but must be sufficiently different as to protect against adversarial or accidental re-identification. There is a need for the development of standards for synthetic data generation and consensus standards for its evaluation. As synthetic data applications expand, ongoing legal and ethical evaluations are crucial to ensure that they remain a secure and effective tool for advancing medical research without compromising individual privacy.

摘要

通过生成对抗网络和潜在扩散模型等人工智能技术生成的合成数据,保留了用于训练这些技术的真实数据中存在的总体模式和关系,同时不暴露个体身份,从而降低了重新识别风险。由于这种方法能够保护隐私并实现组织间的数据集共享,因此在生物医学研究中越来越受到关注。尽管合成数据的使用在金融和高能物理等其他领域已广泛普及,但在医学研究中的应用引发了一些新问题。将合成数据用作保护模型训练所用数据隐私的一种方法,要求数据与原始数据具有高保真度以保持实用性,但又必须有足够差异以防止对抗性或意外的重新识别。需要制定合成数据生成标准及其评估的共识标准。随着合成数据应用的扩展,持续的法律和伦理评估对于确保其在不损害个人隐私的情况下仍然是推进医学研究的安全有效工具至关重要。

相似文献

1
The urgent need to accelerate synthetic data privacy frameworks for medical research.加速医学研究合成数据隐私框架的迫切需求。
Lancet Digit Health. 2025 Feb;7(2):e157-e160. doi: 10.1016/S2589-7500(24)00196-1. Epub 2024 Nov 26.
2
Preserving privacy in healthcare: A systematic review of deep learning approaches for synthetic data generation.医疗保健中的隐私保护:对用于合成数据生成的深度学习方法的系统综述。
Comput Methods Programs Biomed. 2025 Mar;260:108571. doi: 10.1016/j.cmpb.2024.108571. Epub 2024 Dec 28.
3
Data Obfuscation Through Latent Space Projection for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection.通过潜在空间投影进行数据混淆以实现隐私保护的人工智能治理:医学诊断和金融欺诈检测案例研究
JMIRx Med. 2025 Mar 12;6:e70100. doi: 10.2196/70100.
4
On the Fidelity-Privacy Tradeoff of Synthetic Cancer Registry Data.合成癌症登记数据的保真度-隐私权衡。
Stud Health Technol Inform. 2024 Aug 22;316:621-625. doi: 10.3233/SHTI240490.
5
Synthetic Health Data: Real Ethical Promise and Peril.合成健康数据:真实的伦理承诺与危险。
Hastings Cent Rep. 2024 Sep;54(5):8-13. doi: 10.1002/hast.4911.
6
Tunable Privacy Risk Evaluation of Generative Adversarial Networks.生成式对抗网络的可调隐私风险评估。
Stud Health Technol Inform. 2024 Aug 22;316:1233-1237. doi: 10.3233/SHTI240634.
7
Creating High Fidelity Synthetic Pelvis Radiographs Using Generative Adversarial Networks: Unlocking the Potential of Deep Learning Models Without Patient Privacy Concerns.利用生成对抗网络生成高保真骨盆 X 射线:在不涉及患者隐私问题的情况下挖掘深度学习模型的潜力。
J Arthroplasty. 2023 Oct;38(10):2037-2043.e1. doi: 10.1016/j.arth.2022.12.013. Epub 2022 Dec 17.
8
Securing a Generative AI-Powered Healthcare Chatbot.保障生成式 AI 赋能的医疗保健聊天机器人。
Stud Health Technol Inform. 2024 Nov 22;321:195-199. doi: 10.3233/SHTI241091.
9
Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications.基于人工智能的基因组学和用于高通量筛选研究的自动显微镜图像分析中的数据管理与整理实践:推动可靠且符合伦理的人工智能应用。
Hum Genomics. 2025 Feb 23;19(1):16. doi: 10.1186/s40246-025-00716-x.
10
[Expert consensus on ethical requirements for artificial intelligence (AI) processing medical data].[人工智能(AI)处理医学数据的伦理要求专家共识]
Sheng Li Xue Bao. 2024 Dec 25;76(6):937-942.

引用本文的文献

1
Synthetic data can benefit medical research - but risks must be recognized.合成数据可为医学研究带来益处——但必须认识到其中的风险。
Nature. 2025 Sep;645(8080):283. doi: 10.1038/d41586-025-02869-0.
2
Levelling up as a fair solution in AI enabled cancer screening.在人工智能辅助癌症筛查中,将公平性提升作为一种合理的解决方案。
Front Digit Health. 2025 Feb 25;7:1540982. doi: 10.3389/fdgth.2025.1540982. eCollection 2025.
3
Augmenting Insufficiently Accruing Oncology Clinical Trials Using Generative Models: Validation Study.使用生成模型增强入组不足的肿瘤学临床试验:验证研究
J Med Internet Res. 2025 Mar 5;27:e66821. doi: 10.2196/66821.