• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用确定性和概率性方法相结合的混合记录链接方法。

A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology.

机构信息

Department of Pediatrics, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA.

Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA.

出版信息

J Am Med Inform Assoc. 2020 Apr 1;27(4):505-513. doi: 10.1093/jamia/ocz232.

DOI:10.1093/jamia/ocz232
PMID:32049329
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7647290/
Abstract

OBJECTIVE

The disjointed healthcare system and the nonexistence of a universal patient identifier across systems necessitates accurate record linkage (RL). We aim to describe the implementation and evaluation of a hybrid record linkage method in a statewide surveillance system for congenital heart disease.

MATERIALS AND METHODS

Clear-text personally identifiable information on individuals in the Colorado Congenital Heart Disease surveillance system was obtained from 5 electronic health record and medical claims data sources. Two deterministic methods and 1 probabilistic RL method using first name, last name, social security number, date of birth, and house number were initially implemented independently and then sequentially in a hybrid approach to assess RL performance.

RESULTS

16 480 nonunique individuals with congenital heart disease were ascertained. Deterministic linkage methods, when performed independently, yielded 4505 linked pairs (consisting of 2 records linked together within or across data sources). Probabilistic RL, using 3 initial characters of last name and gender for blocking, yielded 6294 linked pairs when executed independently. Using a hybrid linkage routine resulted in 6451 linkages and an additional 18%-24% correct linked pairs as compared to the independent methods. A hybrid linkage routine resulted in higher recall and F-measure scores compared to probabilistic and deterministic methods performed independently.

DISCUSSION

The hybrid approach resulted in increased linkage accuracy and identified pairs of linked record that would have otherwise been missed when using any independent linkage technique.

CONCLUSION

When performing RL within and across disparate data sources, the hybrid RL routine outperformed independent deterministic and probabilistic methods.

摘要

目的

不连贯的医疗保健系统和系统之间缺乏通用的患者标识符,这就需要准确的记录链接(RL)。我们旨在描述在全州先天性心脏病监测系统中实施和评估混合记录链接方法。

材料与方法

从 5 个电子健康记录和医疗索赔数据来源中获取科罗拉多先天性心脏病监测系统中个人的明文身份信息。最初独立实施了 2 种确定性方法和 1 种基于名、姓、社会安全号码、出生日期和门牌号码的概率 RL 方法,然后在混合方法中按顺序进行,以评估 RL 性能。

结果

确定了 16480 名患有先天性心脏病的非独特个体。当独立执行确定性链接方法时,产生了 4505 对链接(由在数据源内或跨数据源链接在一起的 2 条记录组成)。概率 RL 使用姓氏和性别的前 3 个字符进行阻塞,独立执行时产生了 6294 对链接。与独立方法相比,使用混合链接例程可获得 6451 个链接和额外的 18%-24%正确链接对。与独立执行的概率和确定性方法相比,混合链接例程的召回率和 F 度量得分更高。

讨论

混合方法提高了链接准确性,并识别出了使用任何独立链接技术可能会错过的链接记录对。

结论

在跨不同数据源执行 RL 时,混合 RL 例程优于独立的确定性和概率方法。

相似文献

1
A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology.一种使用确定性和概率性方法相结合的混合记录链接方法。
J Am Med Inform Assoc. 2020 Apr 1;27(4):505-513. doi: 10.1093/jamia/ocz232.
2
Linkability measures to assess the data characteristics for record linkage.链接性度量用于评估记录链接的数据特征。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2651-2659. doi: 10.1093/jamia/ocae248.
3
Linking Electronic Health Record and Trauma Registry Data: Assessing the Value of Probabilistic Linkage.连接电子健康记录与创伤登记数据:评估概率性连接的价值。
Methods Inf Med. 2018 Nov;57(5-06):261-269. doi: 10.1055/s-0039-1681087. Epub 2019 Mar 15.
4
Population-level surveillance of congenital heart defects among adolescents and adults in Colorado: Implications of record linkage.科罗拉多州青少年和成年人先天性心脏病的人群水平监测:记录链接的意义。
Am Heart J. 2020 Aug;226:75-84. doi: 10.1016/j.ahj.2020.04.008. Epub 2020 Apr 19.
5
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
6
Comparing record linkage software programs and algorithms using real-world data.使用真实世界的数据比较记录链接软件程序和算法。
PLoS One. 2019 Sep 24;14(9):e0221459. doi: 10.1371/journal.pone.0221459. eCollection 2019.
7
Analysis of identifier performance using a deterministic linkage algorithm.使用确定性链接算法分析标识符性能。
Proc AMIA Symp. 2002:305-9.
8
Linking surveillance and clinical data for evaluating trends in bloodstream infection rates in neonatal units in England.将监测数据与临床数据相链接,以评估英格兰新生儿病房血流感染率的趋势。
PLoS One. 2019 Dec 12;14(12):e0226040. doi: 10.1371/journal.pone.0226040. eCollection 2019.
9
Comparing Methods for Record Linkage for Public Health Action: Matching Algorithm Validation Study.比较公共卫生行动记录链接的方法:匹配算法验证研究。
JMIR Public Health Surveill. 2020 Apr 30;6(2):e15917. doi: 10.2196/15917.
10
The promise of record linkage for assessing the uptake of health services in resource constrained settings: a pilot study from South Africa.在资源受限环境中利用记录链接评估卫生服务利用情况的前景:来自南非的一项试点研究
BMC Med Res Methodol. 2014 May 24;14:71. doi: 10.1186/1471-2288-14-71.

引用本文的文献

1
The missing link: Electronic health record linkage across species offers opportunities for improving One Health.缺失的环节:跨物种的电子健康记录链接为改善“同一健康”提供了机遇。
medRxiv. 2025 Mar 26:2025.03.25.25324490. doi: 10.1101/2025.03.25.25324490.
2
Data linkage multiplies research insights across diverse healthcare sectors.数据关联可成倍增加跨不同医疗保健领域的研究见解。
Commun Med (Lond). 2025 Mar 4;5(1):58. doi: 10.1038/s43856-025-00769-y.
3
Linkability measures to assess the data characteristics for record linkage.链接性度量用于评估记录链接的数据特征。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2651-2659. doi: 10.1093/jamia/ocae248.
4
Evaluating Linkage Quality of Population-Based Administrative Data for Health Service Research.评价基于人群的行政健康服务研究数据的关联质量。
J Korean Med Sci. 2024 Apr 15;39(14):e127. doi: 10.3346/jkms.2024.39.e127.
5
A broadly applicable approach to enrich electronic-health-record cohorts by identifying patients with complete data: a multisite evaluation.一种通过识别具有完整数据的患者来丰富电子健康记录队列的广泛适用方法:多站点评估。
J Am Med Inform Assoc. 2023 Nov 17;30(12):1985-1994. doi: 10.1093/jamia/ocad166.
6
Underreporting of unfavorable outcomes of congenital syphilis on the Notifiable Health Conditions Information System in the state of São Paulo, Brazil, 2007-2018.巴西圣保罗州 2007-2018 年传染病报告系统中先天梅毒不良结局漏报情况。
Epidemiol Serv Saude. 2023 Jul 14;32(2):e2022664. doi: 10.1590/S2237-96222023000200007. eCollection 2023.
7
A cohort of patients in New York State with an alcohol use disorder and subsequent treatment information - A merging of two administrative data sources.纽约州患有酒精使用障碍及后续治疗信息的患者队列 - 两个行政数据源的合并。
J Biomed Inform. 2023 Aug;144:104443. doi: 10.1016/j.jbi.2023.104443. Epub 2023 Jul 16.
8
Engaging Patients and Other Stakeholders in "Designing for Dissemination" of Record Linkage Methods and Tools.让患者和其他利益相关者参与到“为传播设计”记录链接方法和工具中来。
Appl Clin Inform. 2023 Aug;14(4):670-683. doi: 10.1055/a-2105-6505. Epub 2023 Jun 5.
9
Virtual patient identifier (vPID): Improving patient traceability using anonymized identifiers in Japanese healthcare insurance claims database.虚拟患者标识符(vPID):在日本医疗保险理赔数据库中使用匿名标识符提高患者可追溯性。
Heliyon. 2023 May 12;9(5):e16209. doi: 10.1016/j.heliyon.2023.e16209. eCollection 2023 May.
10
Assessing the impact of privacy-preserving record linkage on record overlap and patient demographic and clinical characteristics in PCORnet®, the National Patient-Centered Clinical Research Network.评估在国家以患者为中心的临床研究网络PCORnet®中,隐私保护记录链接对记录重叠以及患者人口统计学和临床特征的影响。
J Am Med Inform Assoc. 2023 Feb 16;30(3):447-455. doi: 10.1093/jamia/ocac229.

本文引用的文献

1
Evaluating the effect of data standardization and validation on patient matching accuracy.评估数据标准化和验证对患者匹配准确性的影响。
J Am Med Inform Assoc. 2019 May 1;26(5):447-456. doi: 10.1093/jamia/ocy191.
2
Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets.在大型医学数据集上使用加密长期密钥和多位树评估隐私保护记录链接。
BMC Med Inform Decis Mak. 2017 Jun 8;17(1):83. doi: 10.1186/s12911-017-0478-5.
3
Using Electronic Health Records for Population Health Research: A Review of Methods and Applications.利用电子健康记录进行人群健康研究:方法与应用综述。
Annu Rev Public Health. 2016;37:61-81. doi: 10.1146/annurev-publhealth-032315-021353. Epub 2015 Dec 11.
4
Design and implementation of a privacy preserving electronic health record linkage tool in Chicago.芝加哥一种隐私保护电子健康记录链接工具的设计与实现
J Am Med Inform Assoc. 2015 Sep;22(5):1072-80. doi: 10.1093/jamia/ocv038. Epub 2015 Jun 23.
5
Improving record linkage performance in the presence of missing linkage data.在存在缺失链接数据的情况下提高记录链接性能。
J Biomed Inform. 2014 Dec;52:43-54. doi: 10.1016/j.jbi.2014.01.016. Epub 2014 Feb 10.
6
A benchmark comparison of deterministic and probabilistic methods for defining manual review datasets in duplicate records reconciliation.在重复记录核对中定义人工审核数据集的确定性方法和概率性方法的基准比较。
J Am Med Inform Assoc. 2014 Jan-Feb;21(1):97-104. doi: 10.1136/amiajnl-2013-001744. Epub 2013 May 23.
7
The development of a data-matching algorithm to define the 'case patient'.用于定义“病例患者”的数据匹配算法的开发。
Aust Health Rev. 2013 Feb;37(1):54-9. doi: 10.1071/AH11161.
8
Linking clinical registry data with administrative data using indirect identifiers: implementation and validation in the congenital heart surgery population.使用间接标识符将临床注册数据与管理数据相链接:在先天性心脏病手术人群中的实施和验证。
Am Heart J. 2010 Dec;160(6):1099-104. doi: 10.1016/j.ahj.2010.08.010.
9
Using global unique identifiers to link autism collections.使用全球唯一标识符来链接自闭症数据集。
J Am Med Inform Assoc. 2010 Nov-Dec;17(6):689-95. doi: 10.1136/jamia.2009.002063.
10
Case study of linking dental and medical healthcare records.牙科和医疗保健记录关联的案例研究。
Am J Manag Care. 2010 Feb 1;16(2):e51-6.