使用安全多方计算在大型数据库中进行隐私保护的记录链接。

Privacy-preserving record linkage in large databases using secure multiparty computation.

机构信息

Cybernetica AS, Ülikooli 2, Tartu, 51003, Estonia.

STACC, Ülikooli 2, Tartu, 51003, Estonia.

出版信息

BMC Med Genomics. 2018 Oct 11;11(Suppl 4):84. doi: 10.1186/s12920-018-0400-8.

DOI:10.1186/s12920-018-0400-8

PMID:30309353

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6180364/

Abstract

BACKGROUND

Practical applications for data analysis may require combining multiple databases belonging to different owners, such as health centers. The analysis should be performed without violating privacy of neither the centers themselves, nor the patients whose records these centers store. To avoid biased analysis results, it may be important to remove duplicate records among the centers, so that each patient's data would be taken into account only once. This task is very closely related to privacy-preserving record linkage.

METHODS

This paper presents a solution to privacy-preserving deduplication among records of several databases using secure multiparty computation. It is build upon one of the fastest practical secure multiparty computation platforms, called Sharemind.

RESULTS

The tests on ca 10 million records of simulated databases with 1000 health centers of 10000 records each show that the computation is feasible in practice. The expected running time of the experiment is ca. 30 min for computing servers connected over 100 Mbit/s WAN, the expected error of the results is 2, and no errors have been detected for the particular test set that we used for our benchmarks.

CONCLUSIONS

The solution is ready for practical use. It has well-defined security properties, implied by the properties of Sharemind platform. The solution assumes that exact matching of records is required, and a possible future research would be extending it to approximate matching.

摘要

背景

数据分析的实际应用可能需要结合属于不同所有者的多个数据库，例如健康中心。分析不应侵犯中心本身或存储这些中心记录的患者的隐私。为避免分析结果出现偏差，可能重要的是要删除中心之间的重复记录，以便仅考虑每个患者的数据一次。此任务与保护隐私的记录链接非常密切。

方法

本文提出了一种使用安全多方计算在多个数据库的记录之间进行隐私保护去重的解决方案。它建立在最快的实用安全多方计算平台之一Sharemind 之上。

结果

对具有 1000 个记录的 1000 个健康中心的模拟数据库中的约 1000 万条记录进行的测试表明，该计算在实践中是可行的。对于通过 100 Mbit/s WAN 连接的计算服务器，预计的实验运行时间约为 30 分钟，结果的预期误差为 2，并且对于我们用于基准测试的特定测试集未检测到任何错误。

结论

该解决方案已准备好实际使用。它具有由 Sharemind 平台的属性隐含的明确定义的安全属性。该解决方案假设需要精确匹配记录，并且未来的一项研究可能是将其扩展到近似匹配。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/25cc/6180364/5e7b66341419/12920_2018_400_Fig1_HTML.jpg

相似文献

Privacy-preserving record linkage in large databases using secure multiparty computation.使用安全多方计算在大型数据库中进行隐私保护的记录链接。

BMC Med Genomics. 2018 Oct 11;11(Suppl 4):84. doi: 10.1186/s12920-018-0400-8.

Secure and scalable deduplication of horizontally partitioned health data for privacy-preserving distributed statistical computation.用于隐私保护分布式统计计算的水平分区健康数据的安全且可扩展的重复数据删除

BMC Med Inform Decis Mak. 2017 Jan 3;17(1):1. doi: 10.1186/s12911-016-0389-x.

Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation. Mainzelliste SecureEpiLinker (MainSEL)：使用安全多方计算进行隐私保护的记录链接。

Bioinformatics. 2022 Mar 4;38(6):1657-1668. doi: 10.1093/bioinformatics/btaa764.

A Privacy-Preserving Log-Rank Test for the Kaplan-Meier Estimator With Secure Multiparty Computation: Algorithm Development and Validation.一种基于安全多方计算的Kaplan-Meier估计器的隐私保护对数秩检验：算法开发与验证

JMIR Med Inform. 2021 Jan 18;9(1):e22158. doi: 10.2196/22158.

Designing an algorithm to preserve privacy for medical record linkage with error-prone data.设计一种算法，在存在错误数据的情况下保护医疗记录链接的隐私。

JMIR Med Inform. 2014 Jan 20;2(1):e2. doi: 10.2196/medinform.3090.

A compressive sensing based secure watermark detection and privacy preserving storage framework.基于压缩感知的安全水印检测和隐私保护存储框架。

IEEE Trans Image Process. 2014 Mar;23(3):1317-28. doi: 10.1109/TIP.2014.2298980.

Two-cloud-servers-assisted secure outsourcing multiparty computation.双云服务器辅助的安全外包多方计算

ScientificWorldJournal. 2014;2014:413265. doi: 10.1155/2014/413265. Epub 2014 May 28.

Privacy-Preserving Integration of Medical Data : A Practical Multiparty Private Set Intersection.医疗数据的隐私保护集成：一种实用的多方私有集交集方法

J Med Syst. 2017 Mar;41(3):37. doi: 10.1007/s10916-016-0657-4. Epub 2017 Jan 16.

High performance logistic regression for privacy-preserving genome analysis.用于隐私保护基因组分析的高性能逻辑回归。

BMC Med Genomics. 2021 Jan 20;14(1):23. doi: 10.1186/s12920-020-00869-9.

Obfuscatable multi-recipient re-encryption for secure privacy-preserving personal health record services.用于安全隐私保护个人健康记录服务的可混淆多接收者重新加密

Technol Health Care. 2015;23 Suppl 1:S139-45. doi: 10.3233/thc-150946.

引用本文的文献

EasySMPC: a simple but powerful no-code tool for practical secure multiparty computation.EasySMPC：一个简单而强大的实用安全多方计算无代码工具。

BMC Bioinformatics. 2022 Dec 9;23(1):531. doi: 10.1186/s12859-022-05044-8.

The evolving privacy and security concerns for genomic data analysis and sharing as observed from the iDASH competition.从 iDASH 竞赛中观察到的基因组数据分析和共享的不断发展的隐私和安全问题。

J Am Med Inform Assoc. 2022 Nov 14;29(12):2182-2190. doi: 10.1093/jamia/ocac165.

Privacy-preserving data sharing infrastructures for medical research: systematization and comparison.用于医学研究的隐私保护数据共享基础架构：系统梳理与比较。

BMC Med Inform Decis Mak. 2021 Aug 12;21(1):242. doi: 10.1186/s12911-021-01602-x.

JMIR Med Inform. 2021 Jan 18;9(1):e22158. doi: 10.2196/22158.

Bioinformatics. 2022 Mar 4;38(6):1657-1668. doi: 10.1093/bioinformatics/btaa764.

Review of Clinical Research Informatics.临床研究信息学述评。

Yearb Med Inform. 2020 Aug;29(1):193-202. doi: 10.1055/s-0040-1701988. Epub 2020 Aug 21.

Fold-stratified cross-validation for unbiased and privacy-preserving federated learning.无偏隐私保护联邦学习的折叠分层交叉验证。

J Am Med Inform Assoc. 2020 Aug 1;27(8):1244-1251. doi: 10.1093/jamia/ocaa096.

iDASH secure genome analysis competition 2017.2017年iDASH安全基因组分析竞赛

BMC Med Genomics. 2018 Oct 11;11(Suppl 4):85. doi: 10.1186/s12920-018-0396-0.

本文引用的文献

Piloting the European Unified Patient Identity Management (EUPID) Concept to Facilitate Secondary Use of Neuroblastoma Data from Clinical Trials and Biobanking.试点欧洲统一患者身份管理（EUPID）概念，以促进神经母细胞瘤临床试验和生物样本库数据的二次利用。

Stud Health Technol Inform. 2016;223:31-8.

Privacy-preserving matching of similar patients.相似患者的隐私保护匹配

J Biomed Inform. 2016 Feb;59:285-98. doi: 10.1016/j.jbi.2015.12.004. Epub 2015 Dec 17.

Design and implementation of a privacy preserving electronic health record linkage tool in Chicago.芝加哥一种隐私保护电子健康记录链接工具的设计与实现

J Am Med Inform Assoc. 2015 Sep;22(5):1072-80. doi: 10.1093/jamia/ocv038. Epub 2015 Jun 23.

A practical approach to achieve private medical record linkage in light of public resources.基于公共资源实现私人医疗记录链接的实用方法。

J Am Med Inform Assoc. 2013 Mar-Apr;20(2):285-92. doi: 10.1136/amiajnl-2012-000917. Epub 2012 Jul 30.

Privacy-preserving record linkage using Bloom filters.使用布隆过滤器的隐私保护记录链接

BMC Med Inform Decis Mak. 2009 Aug 25;9:41. doi: 10.1186/1472-6947-9-41.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用安全多方计算在大型数据库中进行隐私保护的记录链接。

Privacy-preserving record linkage in large databases using secure multiparty computation.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献