• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

量化用于记录链接的隐私保护字符串比较器的正确性、计算复杂度和安全性。

Quantifying the Correctness, Computational Complexity, and Security of Privacy-Preserving String Comparators for Record Linkage.

作者信息

Durham Elizabeth, Xue Yuan, Kantarcioglu Murat, Malin Bradley

机构信息

Department of Biomedical Informatics, Vanderbilt University, 2525 West End Avenue, Nashville, TN 37203, USA.

出版信息

Inf Fusion. 2012 Oct 1;13(4):245-259. doi: 10.1016/j.inffus.2011.04.004.

DOI:10.1016/j.inffus.2011.04.004
PMID:22904698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3418825/
Abstract

Record linkage is the task of identifying records from disparate data sources that refer to the same entity. It is an integral component of data processing in distributed settings, where the integration of information from multiple sources can prevent duplication and enrich overall data quality, thus enabling more detailed and correct analysis. Privacy-preserving record linkage (PPRL) is a variant of the task in which data owners wish to perform linkage without revealing identifiers associated with the records. This task is desirable in various domains, including healthcare, where it may not be possible to reveal patient identity due to confidentiality requirements, and in business, where it could be disadvantageous to divulge customers' identities. To perform PPRL, it is necessary to apply string comparators that function in the privacy-preserving space. A number of privacy-preserving string comparators (PPSCs) have been proposed, but little research has compared them in the context of a real record linkage application. This paper performs a principled and comprehensive evaluation of six PPSCs in terms of three key properties: 1) correctness of record linkage predictions, 2) computational complexity, and 3) security. We utilize a real publicly-available dataset, derived from the North Carolina voter registration database, to evaluate the tradeoffs between the aforementioned properties. Among our results, we find that PPSCs that partition, encode, and compare strings yield highly accurate record linkage results. However, as a tradeoff, we observe that such PPSCs are less secure than those that map and compare strings in a reduced dimensional space.

摘要

记录链接是指从不同数据源中识别出指向同一实体的记录的任务。它是分布式环境中数据处理的一个不可或缺的组成部分,在这种环境下,整合来自多个源的信息可以防止数据重复并提高整体数据质量,从而实现更详细、准确的分析。隐私保护记录链接(PPRL)是该任务的一种变体,其中数据所有者希望在不泄露与记录相关的标识符的情况下执行链接。在包括医疗保健在内的各个领域,由于保密要求可能无法透露患者身份,以及在商业领域,泄露客户身份可能不利,因此这项任务很有必要。为了执行PPRL,有必要应用在隐私保护空间中起作用的字符串比较器。已经提出了许多隐私保护字符串比较器(PPSC),但很少有研究在实际记录链接应用的背景下对它们进行比较。本文从三个关键属性方面对六个PPSC进行了有原则的全面评估:1)记录链接预测的正确性,2)计算复杂性,以及3)安全性。我们利用一个从北卡罗来纳州选民登记数据库导出的真实公开可用数据集,来评估上述属性之间的权衡。在我们的结果中,我们发现对字符串进行分区、编码和比较的PPSC产生了高度准确的记录链接结果。然而,作为一种权衡,我们观察到这类PPSC的安全性低于那些在降维空间中映射和比较字符串的PPSC。

相似文献

1
Quantifying the Correctness, Computational Complexity, and Security of Privacy-Preserving String Comparators for Record Linkage.量化用于记录链接的隐私保护字符串比较器的正确性、计算复杂度和安全性。
Inf Fusion. 2012 Oct 1;13(4):245-259. doi: 10.1016/j.inffus.2011.04.004.
2
Encoding of Numerical Data for Privacy-Preserving Record Linkage.用于隐私保护记录链接的数值数据编码
Stud Health Technol Inform. 2020 Jun 23;271:23-30. doi: 10.3233/SHTI200070.
3
Optimization of the Mainzelliste software for fast privacy-preserving record linkage.优化 Mainzelliste 软件以实现快速的隐私保护记录链接。
J Transl Med. 2021 Jan 15;19(1):33. doi: 10.1186/s12967-020-02678-1.
4
Secure privacy-preserving record linkage system from re-identification attack.防范重新识别攻击的安全隐私保护记录链接系统。
PLoS One. 2025 Jan 9;20(1):e0314486. doi: 10.1371/journal.pone.0314486. eCollection 2025.
5
Privacy preserving probabilistic record linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality.隐私保护概率性记录链接(P3RL):一种链接现有健康相关数据并维护参与者隐私的新方法。
BMC Med Res Methodol. 2015 May 30;15:46. doi: 10.1186/s12874-015-0038-6.
6
On the effectiveness of graph matching attacks against privacy-preserving record linkage.图匹配攻击对隐私保护记录链接有效性的研究。
PLoS One. 2022 Sep 22;17(9):e0267893. doi: 10.1371/journal.pone.0267893. eCollection 2022.
7
Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets.在大型医学数据集上使用加密长期密钥和多位树评估隐私保护记录链接。
BMC Med Inform Decis Mak. 2017 Jun 8;17(1):83. doi: 10.1186/s12911-017-0478-5.
8
A blinded evaluation of privacy preserving record linkage with Bloom filters.使用布隆过滤器进行隐私保护记录链接的盲评估。
BMC Med Res Methodol. 2022 Jan 16;22(1):22. doi: 10.1186/s12874-022-01510-2.
9
Privacy preserving record linkage for public health action: opportunities and challenges.隐私保护的记录链接在公共卫生行动中的机遇与挑战。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2605-2612. doi: 10.1093/jamia/ocae196.
10
Proposed Framework for Adopting Privacy-Preserving Record Linkage for Public Health Action.采用隐私保护记录链接以开展公共卫生行动的拟议框架。
J Public Health Manag Pract. 2025;31(1):E26-E33. doi: 10.1097/PHH.0000000000002027. Epub 2024 Oct 16.

引用本文的文献

1
Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets.在大型医学数据集上使用加密长期密钥和多位树评估隐私保护记录链接。
BMC Med Inform Decis Mak. 2017 Jun 8;17(1):83. doi: 10.1186/s12911-017-0478-5.
2
Validity of a stroke severity index for administrative claims data research: a retrospective cohort study.行政索赔数据研究中卒中严重程度指数的有效性:一项回顾性队列研究。
BMC Health Serv Res. 2016 Sep 22;16(1):509. doi: 10.1186/s12913-016-1769-8.
3
Privacy preserving probabilistic record linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality.隐私保护概率性记录链接(P3RL):一种链接现有健康相关数据并维护参与者隐私的新方法。
BMC Med Res Methodol. 2015 May 30;15:46. doi: 10.1186/s12874-015-0038-6.
4
SOEMPI: A Secure Open Enterprise Master Patient Index Software Toolkit for Private Record Linkage.SOEMPI:用于私人记录链接的安全开放式企业主患者索引软件工具包。
AMIA Annu Symp Proc. 2014 Nov 14;2014:1105-14. eCollection 2014.
5
Composite Bloom Filters for Secure Record Linkage.用于安全记录链接的复合布隆过滤器
IEEE Trans Knowl Data Eng. 2014 Dec;26(12):2956-2968. doi: 10.1109/TKDE.2013.91.
6
Privacy preserving interactive record linkage (PPIRL).隐私保护交互式记录链接(PPIRL)。
J Am Med Inform Assoc. 2014 Mar-Apr;21(2):212-20. doi: 10.1136/amiajnl-2013-002165. Epub 2013 Nov 7.
7
A practical approach to achieve private medical record linkage in light of public resources.基于公共资源实现私人医疗记录链接的实用方法。
J Am Med Inform Assoc. 2013 Mar-Apr;20(2):285-92. doi: 10.1136/amiajnl-2012-000917. Epub 2012 Jul 30.

本文引用的文献

1
Private medical record linkage with approximate matching.通过近似匹配实现的私人医疗记录关联。
AMIA Annu Symp Proc. 2010 Nov 13;2010:182-6.
2
Privacy-preserving record linkage using Bloom filters.使用布隆过滤器的隐私保护记录链接
BMC Med Inform Decis Mak. 2009 Aug 25;9:41. doi: 10.1186/1472-6947-9-41.
3
Record linkage: making the most out of errors in linking variables.记录链接:充分利用链接变量中的错误。
AMIA Annu Symp Proc. 2006;2006:779-83.
4
Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper.迈向健康数据二次利用的国家框架:美国医学信息学会白皮书
J Am Med Inform Assoc. 2007 Jan-Feb;14(1):1-9. doi: 10.1197/jamia.M2273. Epub 2006 Oct 31.
5
Real world performance of approximate string comparators for use in patient matching.用于患者匹配的近似字符串比较器的实际性能。
Stud Health Technol Inform. 2004;107(Pt 1):43-7.
6
Some methods for blindfolded record linkage.一些用于盲态记录链接的方法。
BMC Med Inform Decis Mak. 2004 Jun 28;4:9. doi: 10.1186/1472-6947-4-9.
7
How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems.如何(不)在分布式网络中保护基因组数据隐私:利用踪迹重新识别来评估和设计匿名保护系统。
J Biomed Inform. 2004 Jun;37(3):179-92. doi: 10.1016/j.jbi.2004.04.005.
8
Zero-check: a zero-knowledge protocol for reconciling patient identities across institutions.零检查:一种用于跨机构协调患者身份的零知识协议。
Arch Pathol Lab Med. 2004 Mar;128(3):344-6. doi: 10.5858/2004-128-344-ZAZPFR.
9
Analysis of a probabilistic record linkage technique without human review.一种无需人工审核的概率性记录链接技术分析。
AMIA Annu Symp Proc. 2003;2003:259-63.
10
Analysis of identifier performance using a deterministic linkage algorithm.使用确定性链接算法分析标识符性能。
Proc AMIA Symp. 2002:305-9.