• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
The Impact of Name Transformation on Match Rates Within a Large Consumer Database.名称转换对大型消费者数据库中匹配率的影响。
AMIA Annu Symp Proc. 2023 Apr 29;2022:692-699. eCollection 2022.
2
Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets.在大型医学数据集上使用加密长期密钥和多位树评估隐私保护记录链接。
BMC Med Inform Decis Mak. 2017 Jun 8;17(1):83. doi: 10.1186/s12911-017-0478-5.
3
Optimization of the Mainzelliste software for fast privacy-preserving record linkage.优化 Mainzelliste 软件以实现快速的隐私保护记录链接。
J Transl Med. 2021 Jan 15;19(1):33. doi: 10.1186/s12967-020-02678-1.
4
Estimating parameters for probabilistic linkage of privacy-preserved datasets.估算隐私保护数据集概率关联的参数。
BMC Med Res Methodol. 2017 Jul 10;17(1):95. doi: 10.1186/s12874-017-0370-0.
5
Validating a novel deterministic privacy-preserving record linkage between administrative & clinical data: applications in stroke research.验证一种新颖的行政与临床数据确定性隐私保护记录链接方法:在中风研究中的应用。
Int J Popul Data Sci. 2022 Nov 22;7(4):1755. doi: 10.23889/ijpds.v7i4.1755. eCollection 2022.
6
Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation. Mainzelliste SecureEpiLinker (MainSEL):使用安全多方计算进行隐私保护的记录链接。
Bioinformatics. 2022 Mar 4;38(6):1657-1668. doi: 10.1093/bioinformatics/btaa764.
7
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation.去标识化贝叶斯个人身份匹配用于隐私保护记录链接,即使存在错误:开发和验证。
BMC Med Inform Decis Mak. 2023 May 5;23(1):85. doi: 10.1186/s12911-023-02176-6.
8
Privacy preserving probabilistic record linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality.隐私保护概率性记录链接(P3RL):一种链接现有健康相关数据并维护参与者隐私的新方法。
BMC Med Res Methodol. 2015 May 30;15:46. doi: 10.1186/s12874-015-0038-6.
9
Using machine learning to link electronic health records in cancer registries: On the tradeoff between linkage quality and manual effort.使用机器学习链接癌症登记处的电子健康记录:在链接质量和人工努力之间的权衡。
Int J Med Inform. 2024 May;185:105387. doi: 10.1016/j.ijmedinf.2024.105387. Epub 2024 Feb 28.
10
The Data-Adaptive Fellegi-Sunter Model for Probabilistic Record Linkage: Algorithm Development and Validation for Incorporating Missing Data and Field Selection.数据自适应 Fellegi-Sunter 模型在概率记录链接中的应用:纳入缺失数据和字段选择的算法开发和验证。
J Med Internet Res. 2022 Sep 29;24(9):e33775. doi: 10.2196/33775.

引用本文的文献

1
Accuracy of privacy preserving record linkage for real world data in the United States: a systemic review.美国真实世界数据中隐私保护记录链接的准确性:一项系统评价。
JAMIA Open. 2025 Jan 22;8(1):ooaf002. doi: 10.1093/jamiaopen/ooaf002. eCollection 2025 Feb.

本文引用的文献

1
Joining Health Care and Homeless Data Systems Using Privacy-Preserving Record-Linkage Software.使用隐私保护记录链接软件连接医疗保健和无家可归者数据系统。
Am J Public Health. 2021 Aug;111(8):1400-1403. doi: 10.2105/AJPH.2021.306304.
2
Implementing a hash-based privacy-preserving record linkage tool in the OneFlorida clinical research network.在佛罗里达临床研究网络中实施基于哈希的隐私保护记录链接工具。
JAMIA Open. 2019 Sep 27;2(4):562-569. doi: 10.1093/jamiaopen/ooz050. eCollection 2019 Dec.
3
Health Care Utilization Among Homeless Veterans in Chicago.芝加哥无家可归退伍军人的医疗保健利用情况。
Mil Med. 2020 Mar 2;185(3-4):e335-e339. doi: 10.1093/milmed/usz264.
4
Data linkages between patient-powered research networks and health plans: a foundation for collaborative research.患者主导的研究网络与健康计划之间的数据链接:合作研究的基础。
J Am Med Inform Assoc. 2019 Jul 1;26(7):594-602. doi: 10.1093/jamia/ocz012.
5
Evaluating the effect of data standardization and validation on patient matching accuracy.评估数据标准化和验证对患者匹配准确性的影响。
J Am Med Inform Assoc. 2019 May 1;26(5):447-456. doi: 10.1093/jamia/ocy191.
6
Validity of Cardiovascular Data From Electronic Sources: The Multi-Ethnic Study of Atherosclerosis and HealthLNK.电子来源心血管数据的有效性:动脉粥样硬化与健康LNK多族裔研究
Circulation. 2017 Sep 26;136(13):1207-1216. doi: 10.1161/CIRCULATIONAHA.117.027436. Epub 2017 Jul 7.
7
The Building Blocks of Interoperability. A Multisite Analysis of Patient Demographic Attributes Available for Matching.互操作性的构建要素。对可用于匹配的患者人口统计学属性的多中心分析。
Appl Clin Inform. 2017 Apr 5;8(2):322-336. doi: 10.4338/ACI-2016-11-RA-0196.
8
Disease Outcomes and Care Fragmentation Among Patients With Systemic Lupus Erythematosus.系统性红斑狼疮患者的疾病转归与医疗碎片化
Arthritis Care Res (Hoboken). 2017 Sep;69(9):1369-1376. doi: 10.1002/acr.23161. Epub 2017 Aug 8.
9
An Evaluation of Recurrent Diabetic Ketoacidosis, Fragmentation of Care, and Mortality Across Chicago, Illinois.伊利诺伊州芝加哥市对复发性糖尿病酮症酸中毒、医疗碎片化和死亡率的评估。
Diabetes Care. 2016 Oct;39(10):1671-6. doi: 10.2337/dc16-0668. Epub 2016 Jul 15.
10
Design and implementation of a privacy preserving electronic health record linkage tool in Chicago.芝加哥一种隐私保护电子健康记录链接工具的设计与实现
J Am Med Inform Assoc. 2015 Sep;22(5):1072-80. doi: 10.1093/jamia/ocv038. Epub 2015 Jun 23.

名称转换对大型消费者数据库中匹配率的影响。

The Impact of Name Transformation on Match Rates Within a Large Consumer Database.

机构信息

Datavant, San Francisco, CA.

Northwestern University, Chicago, IL.

出版信息

AMIA Annu Symp Proc. 2023 Apr 29;2022:692-699. eCollection 2022.

PMID:37128403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10148307/
Abstract

Accurate record linkage depends on the availability and quality of features such as first name and last name. Privacy preserving record linkage methods using tokenization is sensitive to perturbations in the patient features used as inputs. In this study we evaluated the impact of name transformations on the accuracy of patient matching using a large commercial dataset. We used a set of 68 million records representing 59 million unique individuals, and implemented and evaluated eight name transformation strategies, and generated precision, recall and F1 scores. Transforming names to include the most common nicknames resulted in a significant gain in recall while maintaining precision, and generated the highest F1 score compared with no name transformation (0.905 vs 0.807). Strategies tailored to transforming patient features can improve the precision and recall of patient matching, and make it possible to create high quality, linked datasets for research purposes.

摘要

准确的记录链接依赖于特征(如名字和姓氏)的可用性和质量。使用标记化的隐私保护记录链接方法对作为输入的患者特征的干扰很敏感。在这项研究中,我们使用大型商业数据集评估了名称转换对患者匹配准确性的影响。我们使用了一组代表 5900 万个唯一个体的 6800 万条记录,并实现和评估了八种名称转换策略,并生成了精度、召回率和 F1 分数。将姓名转换为包含最常见的昵称可以显著提高召回率,同时保持精度,并与不进行名称转换相比生成最高的 F1 分数(0.905 比 0.807)。针对转换患者特征的策略可以提高患者匹配的精度和召回率,并有可能为研究目的创建高质量的、链接的数据集。