• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种数据驱动的方法,用于在差分隐私下为临床试验数据共享选择隐私参数。

A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.

机构信息

Study Design and Data Analysis, College of Public Health, University of South Florida, Tampa, FL 33612, United States.

Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, IN 46556, United States.

出版信息

J Am Med Inform Assoc. 2024 Apr 19;31(5):1135-1143. doi: 10.1093/jamia/ocae038.

DOI:10.1093/jamia/ocae038
PMID:38457282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11031247/
Abstract

OBJECTIVES

Clinical trial data sharing is crucial for promoting transparency and collaborative efforts in medical research. Differential privacy (DP) is a formal statistical technique for anonymizing shared data that balances privacy of individual records and accuracy of replicated results through a "privacy budget" parameter, ε. DP is considered the state of the art in privacy-protected data publication and is underutilized in clinical trial data sharing. This study is focused on identifying ε values for the sharing of clinical trial data.

MATERIALS AND METHODS

We analyzed 2 clinical trial datasets with privacy budget ε ranging from 0.01 to 10. Smaller values of ε entail adding greater amounts of random noise, with better privacy as a result. Comparison of rates, odds ratios, means, and mean differences between the original clinical trial datasets and the empirical distribution of the DP estimator was performed.

RESULTS

The DP rate closely approximated the original rate of 6.5% when ε > 1. The DP odds ratio closely aligned with the original odds ratio of 0.689 when ε ≥ 3. The DP mean closely approximated the original mean of 164.64 when ε ≥ 1. As ε increased to 5, both the minimum and maximum DP means converged toward the original mean.

DISCUSSION

There is no consensus on how to choose the privacy budget ε. The definition of DP does not specify the required level of privacy, and there is no established formula for determining ε.

CONCLUSION

Our findings suggest that the application of DP holds promise in the context of sharing clinical trial data.

摘要

目的

临床试验数据共享对于促进医学研究的透明度和协作至关重要。差分隐私(DP)是一种通过“隐私预算”参数 ε 对共享数据进行匿名化的正式统计技术,该参数在平衡个体记录的隐私和复制结果的准确性方面发挥着作用。DP 被认为是隐私保护数据发布的最新技术,但在临床试验数据共享中并未得到充分利用。本研究旨在确定共享临床试验数据的 ε 值。

材料与方法

我们分析了两个隐私预算 ε 值范围为 0.01 至 10 的临床试验数据集。较小的 ε 值意味着需要添加更多的随机噪声,从而获得更好的隐私保护效果。对原始临床试验数据集和 DP 估计量的经验分布之间的比率、优势比、均值和均值差异进行了比较。

结果

当 ε >1 时,DP 率与原始的 6.5%率非常接近。当 ε≥3 时,DP 优势比与原始的 0.689 优势比非常吻合。当 ε≥1 时,DP 均值与原始均值 164.64 非常接近。当 ε 增加到 5 时,DP 均值的最小值和最大值都趋近于原始均值。

讨论

目前尚无关于如何选择隐私预算 ε 的共识。DP 的定义并未指定所需的隐私级别,也没有确定 ε 的既定公式。

结论

我们的研究结果表明,DP 在共享临床试验数据方面具有广阔的应用前景。

相似文献

1
A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.一种数据驱动的方法,用于在差分隐私下为临床试验数据共享选择隐私参数。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1135-1143. doi: 10.1093/jamia/ocae038.
2
Does Differentially Private Synthetic Data Lead to Synthetic Discoveries?差分隐私合成数据是否会导致合成发现?
Methods Inf Med. 2024 May;63(1-02):35-51. doi: 10.1055/a-2385-1355. Epub 2024 Aug 13.
3
Federated learning with differential privacy for breast cancer diagnosis enabling secure data sharing and model integrity.用于乳腺癌诊断的具有差分隐私的联邦学习,实现安全的数据共享和模型完整性。
Sci Rep. 2025 Apr 16;15(1):13061. doi: 10.1038/s41598-025-95858-2.
4
Local Differential Privacy in the Medical Domain to Protect Sensitive Information: Algorithm Development and Real-World Validation.医疗领域中用于保护敏感信息的局部差分隐私:算法开发与实际验证
JMIR Med Inform. 2021 Nov 8;9(11):e26914. doi: 10.2196/26914.
5
The project data sphere initiative: accelerating cancer research by sharing data.项目数据领域计划:通过数据共享加速癌症研究
Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.
6
Federated learning with differential privacy via fast Fourier transform for tighter-efficient combining.通过快速傅里叶变换实现具有差分隐私的联邦学习,以进行更紧密高效的合并。
Sci Rep. 2024 Nov 5;14(1):26770. doi: 10.1038/s41598-024-77428-0.
7
Protecting patient privacy when sharing patient-level data from clinical trials.在共享临床试验中患者层面的数据时保护患者隐私。
BMC Med Res Methodol. 2016 Jul 8;16 Suppl 1(Suppl 1):77. doi: 10.1186/s12874-016-0169-4.
8
Findings from 2017 on Consumer Health Informatics and Education: Health Data Access and Sharing.2017年消费者健康信息学与教育研究结果:健康数据的获取与共享
Yearb Med Inform. 2018 Aug;27(1):163-169. doi: 10.1055/s-0038-1641218. Epub 2018 Aug 29.
9
Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing.隐私保护生成式深度神经网络支持临床数据共享。
Circ Cardiovasc Qual Outcomes. 2019 Jul;12(7):e005122. doi: 10.1161/CIRCOUTCOMES.118.005122. Epub 2019 Jul 9.
10
Sparsified federated learning with differential privacy for intrusion detection in VANETs based on Fisher Information Matrix.基于 Fisher 信息矩阵的 VANET 入侵检测的稀疏联邦学习与差分隐私。
PLoS One. 2024 Apr 17;19(4):e0301897. doi: 10.1371/journal.pone.0301897. eCollection 2024.

引用本文的文献

1
Applications and challenges of biomarker-based predictive models in proactive health management.基于生物标志物的预测模型在主动健康管理中的应用与挑战
Front Public Health. 2025 Aug 18;13:1633487. doi: 10.3389/fpubh.2025.1633487. eCollection 2025.
2
An empirical assessment of differential privacy in real-world observational data: a case-control study of asthma exacerbation in UK Biobank linked with electronic health records.现实世界观察数据中差分隐私的实证评估:英国生物银行与电子健康记录关联的哮喘加重病例对照研究。
J Am Med Inform Assoc. 2025 Aug 1;32(8):1328-1339. doi: 10.1093/jamia/ocaf090.

本文引用的文献

1
Making data sharing the norm in medical research.使数据共享成为医学研究中的常态。
BMJ. 2023 Jul 11;382:1434. doi: 10.1136/bmj.p1434.
2
Some examples of privacy-preserving sharing of COVID-19 pandemic data with statistical utility evaluation.一些具有统计效用评估的 COVID-19 大流行数据隐私保护共享的例子。
BMC Med Res Methodol. 2023 May 19;23(1):120. doi: 10.1186/s12874-023-01927-3.
3
Data sharing and community-engaged research.数据共享和社区参与式研究。
Science. 2022 Oct 14;378(6616):141-143. doi: 10.1126/science.abq6851. Epub 2022 Oct 13.
4
Many researchers say they'll share data - but don't.许多研究人员表示他们会分享数据,但实际上却没有这么做。
Nature. 2022 Jun;606(7916):853. doi: 10.1038/d41586-022-01692-1.
5
Many researchers were not compliant with their published data sharing statement: a mixed-methods study.许多研究人员未遵守其公布的数据共享声明:一项混合方法研究。
J Clin Epidemiol. 2022 Oct;150:33-41. doi: 10.1016/j.jclinepi.2022.05.019. Epub 2022 May 30.
6
A systematic review of homomorphic encryption and its contributions in healthcare industry.同态加密及其在医疗行业贡献的系统综述。
Complex Intell Systems. 2022 May 3:1-28. doi: 10.1007/s40747-022-00756-z.
7
Differential privacy in health research: A scoping review.健康研究中的差分隐私:范围综述。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2269-2276. doi: 10.1093/jamia/ocab135.
8
Medical imaging deep learning with differential privacy.医学影像深度学习中的差分隐私。
Sci Rep. 2021 Jun 29;11(1):13524. doi: 10.1038/s41598-021-93030-0.
9
How differential privacy will affect our understanding of health disparities in the United States.差分隐私将如何影响我们对美国健康差异的理解。
Proc Natl Acad Sci U S A. 2020 Jun 16;117(24):13405-13412. doi: 10.1073/pnas.2003714117. Epub 2020 May 28.
10
Time for NIH to lead on data sharing.美国国立卫生研究院是时候引领数据共享了。
Science. 2020 Mar 20;367(6484):1308-1309. doi: 10.1126/science.aba4456.