• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分布式生物医学数据的隐私保护分析:运用分布式统计学习理论设计高效且安全的多方计算

Privacy-Preserving Analysis of Distributed Biomedical Data: Designing Efficient and Secure Multiparty Computations Using Distributed Statistical Learning Theory.

作者信息

Dankar Fida K, Madathil Nisha, Dankar Samar K, Boughorbel Sabri

机构信息

United Arab Emirates University, Abu Dhabi, United Arab Emirates.

Independent Scientist, Ottawa, ON, Canada.

出版信息

JMIR Med Inform. 2019 Apr 29;7(2):e12702. doi: 10.2196/12702.

DOI:10.2196/12702
PMID:31033449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6658266/
Abstract

BACKGROUND

Biomedical research often requires large cohorts and necessitates the sharing of biomedical data with researchers around the world, which raises many privacy, ethical, and legal concerns. In the face of these concerns, privacy experts are trying to explore approaches to analyzing the distributed data while protecting its privacy. Many of these approaches are based on secure multiparty computations (SMCs). SMC is an attractive approach allowing multiple parties to collectively carry out calculations on their datasets without having to reveal their own raw data; however, it incurs heavy computation time and requires extensive communication between the involved parties.

OBJECTIVE

This study aimed to develop usable and efficient SMC applications that meet the needs of the potential end-users and to raise general awareness about SMC as a tool that supports data sharing.

METHODS

We have introduced distributed statistical computing (DSC) into the design of secure multiparty protocols, which allows us to conduct computations on each of the parties' sites independently and then combine these computations to form 1 estimator for the collective dataset, thus limiting communication to the final step and reducing complexity. The effectiveness of our privacy-preserving model is demonstrated through a linear regression application.

RESULTS

Our secure linear regression algorithm was tested for accuracy and performance using real and synthetic datasets. The results showed no loss of accuracy (over nonsecure regression) and very good performance (20 min for 100 million records).

CONCLUSIONS

We used DSC to securely calculate a linear regression model over multiple datasets. Our experiments showed very good performance (in terms of the number of records it can handle). We plan to extend our method to other estimators such as logistic regression.

摘要

背景

生物医学研究通常需要大量队列,并需要与全球的研究人员共享生物医学数据,这引发了许多隐私、伦理和法律方面的担忧。面对这些担忧,隐私专家试图探索在保护隐私的同时分析分布式数据的方法。其中许多方法基于安全多方计算(SMC)。SMC是一种有吸引力的方法,它允许多方对其数据集进行集体计算,而无需透露自己的原始数据;然而,它会带来大量的计算时间,并且需要参与方之间进行广泛的通信。

目的

本研究旨在开发满足潜在终端用户需求的可用且高效的SMC应用程序,并提高人们对SMC作为支持数据共享工具的普遍认识。

方法

我们将分布式统计计算(DSC)引入到安全多方协议的设计中,这使我们能够在各方站点上独立进行计算,然后将这些计算结果合并以形成针对集体数据集的1个估计量,从而将通信限制在最后一步并降低复杂性。我们通过线性回归应用展示了隐私保护模型的有效性。

结果

我们的安全线性回归算法使用真实和合成数据集进行了准确性和性能测试。结果表明,(与非安全回归相比)准确性没有损失,并且性能非常好(处理1亿条记录需要20分钟)。

结论

我们使用DSC在多个数据集上安全地计算线性回归模型。我们的实验显示了非常好的性能(就其能够处理的记录数量而言)。我们计划将我们的方法扩展到其他估计量,如逻辑回归。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/695e7d40dd29/medinform_v7i2e12702_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/3dcf4ac3b235/medinform_v7i2e12702_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/0a746d9f67be/medinform_v7i2e12702_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/c44836ac13ee/medinform_v7i2e12702_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/07887d1b3e86/medinform_v7i2e12702_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/9053ed05f82f/medinform_v7i2e12702_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/695e7d40dd29/medinform_v7i2e12702_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/3dcf4ac3b235/medinform_v7i2e12702_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/0a746d9f67be/medinform_v7i2e12702_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/c44836ac13ee/medinform_v7i2e12702_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/07887d1b3e86/medinform_v7i2e12702_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/9053ed05f82f/medinform_v7i2e12702_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/6658266/695e7d40dd29/medinform_v7i2e12702_fig6.jpg

相似文献

1
Privacy-Preserving Analysis of Distributed Biomedical Data: Designing Efficient and Secure Multiparty Computations Using Distributed Statistical Learning Theory.分布式生物医学数据的隐私保护分析:运用分布式统计学习理论设计高效且安全的多方计算
JMIR Med Inform. 2019 Apr 29;7(2):e12702. doi: 10.2196/12702.
2
A Privacy-Preserving Log-Rank Test for the Kaplan-Meier Estimator With Secure Multiparty Computation: Algorithm Development and Validation.一种基于安全多方计算的Kaplan-Meier估计器的隐私保护对数秩检验:算法开发与验证
JMIR Med Inform. 2021 Jan 18;9(1):e22158. doi: 10.2196/22158.
3
Multiparty Secure Broad Learning System for Privacy Preserving.用于隐私保护的多方安全广义学习系统
IEEE Trans Cybern. 2023 Oct;53(10):6636-6648. doi: 10.1109/TCYB.2023.3235496. Epub 2023 Sep 15.
4
A Comprehensive Comparison of Multiparty Secure Additions with Differential Privacy.多方安全加法与差分隐私的全面比较
IEEE Trans Dependable Secure Comput. 2017 Sep-Oct;14(5):463-477. doi: 10.1109/TDSC.2015.2484326. Epub 2015 Oct 1.
5
Differentially Private Distributed Online Learning.差分隐私分布式在线学习
IEEE Trans Knowl Data Eng. 2018 Aug;30(8):1440-1453. doi: 10.1109/TKDE.2018.2794384. Epub 2018 Jan 17.
6
Scalable and Privacy-Preserving Federated Principal Component Analysis.可扩展且保护隐私的联邦主成分分析
Proc IEEE Symp Secur Priv. 2023 May;2023:1908-1925. doi: 10.1109/sp46215.2023.10179350. Epub 2023 Jul 21.
7
Privacy-preserving logistic regression with secret sharing.基于秘密共享的隐私保护逻辑回归。
BMC Med Inform Decis Mak. 2022 Apr 2;22(1):89. doi: 10.1186/s12911-022-01811-y.
8
High performance logistic regression for privacy-preserving genome analysis.用于隐私保护基因组分析的高性能逻辑回归。
BMC Med Genomics. 2021 Jan 20;14(1):23. doi: 10.1186/s12920-020-00869-9.
9
EasySMPC: a simple but powerful no-code tool for practical secure multiparty computation.EasySMPC:一个简单而强大的实用安全多方计算无代码工具。
BMC Bioinformatics. 2022 Dec 9;23(1):531. doi: 10.1186/s12859-022-05044-8.
10
Collaborative privacy-preserving analysis of oncological data using multiparty homomorphic encryption.使用多方同态加密技术进行协作式隐私保护的肿瘤学数据分析。
Proc Natl Acad Sci U S A. 2023 Aug 15;120(33):e2304415120. doi: 10.1073/pnas.2304415120. Epub 2023 Aug 7.

引用本文的文献

1
Revolutionizing healthcare data analytics with federated learning: A comprehensive survey of applications, systems, and future directions.利用联邦学习革新医疗数据分析:应用、系统及未来方向的全面综述
Comput Struct Biotechnol J. 2025 Jun 11;28:217-238. doi: 10.1016/j.csbj.2025.06.009. eCollection 2025.
2
EasySMPC: a simple but powerful no-code tool for practical secure multiparty computation.EasySMPC:一个简单而强大的实用安全多方计算无代码工具。
BMC Bioinformatics. 2022 Dec 9;23(1):531. doi: 10.1186/s12859-022-05044-8.
3
New Approach to Privacy-Preserving Clinical Decision Support Systems for HIV Treatment.

本文引用的文献

1
The development of large-scale de-identified biomedical databases in the age of genomics-principles and challenges.基因组时代大规模去识别生物医学数据库的发展:原则与挑战。
Hum Genomics. 2018 Apr 10;12(1):19. doi: 10.1186/s40246-018-0147-5.
2
A secure distributed logistic regression protocol for the detection of rare adverse drug events.一种用于检测罕见药物不良事件的安全分布式逻辑回归协议。
J Am Med Inform Assoc. 2013 May 1;20(3):453-61. doi: 10.1136/amiajnl-2011-000735. Epub 2012 Aug 7.
3
On the linearity of nature or the risk of extending regression lines beyond the observed data.
针对 HIV 治疗的隐私保护临床决策支持系统的新方法。
J Med Syst. 2022 Oct 20;46(12):84. doi: 10.1007/s10916-022-01851-x.
4
A comparison of distributed machine learning methods for the support of "many labs" collaborations in computational modeling of decision making.用于支持决策计算建模中“多实验室”合作的分布式机器学习方法比较
Front Psychol. 2022 Aug 25;13:943198. doi: 10.3389/fpsyg.2022.943198. eCollection 2022.
5
Privacy-Preserving Artificial Intelligence Techniques in Biomedicine.生物医学中的隐私保护人工智能技术。
Methods Inf Med. 2022 Jun;61(S 01):e12-e27. doi: 10.1055/s-0041-1740630. Epub 2022 Jan 21.
6
Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing.医疗保健中使用合成数据的监督式机器学习的可靠性:用于数据共享时保护隐私的模型
JMIR Med Inform. 2020 Jul 20;8(7):e18910. doi: 10.2196/18910.
7
A comprehensive tool for creating and evaluating privacy-preserving biomedical prediction models.用于创建和评估隐私保护的生物医学预测模型的综合工具。
BMC Med Inform Decis Mak. 2020 Feb 11;20(1):29. doi: 10.1186/s12911-020-1041-3.
8
Informed Consent in Biomedical Research.生物医学研究中的知情同意
Comput Struct Biotechnol J. 2019 Mar 25;17:463-474. doi: 10.1016/j.csbj.2019.03.010. eCollection 2019.
Eur J Clin Invest. 2011 Nov;41(11):1259-60. doi: 10.1111/j.1365-2362.2011.02527.x. Epub 2011 Apr 5.
4
Genetic polymorphisms and susceptibility to lung disease.基因多态性与肺部疾病易感性
J Negat Results Biomed. 2006 Apr 11;5:5. doi: 10.1186/1477-5751-5-5.