• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种无需人工审核的概率性记录链接技术分析。

Analysis of a probabilistic record linkage technique without human review.

作者信息

Grannis Shaun J, Overhage J Marc, Hui Siu, McDonald Clement J

机构信息

Regenstrief Institute and Indiana University School of Medicine, Indianapolis, IN, USA.

出版信息

AMIA Annu Symp Proc. 2003;2003:259-63.

PMID:14728174
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1479910/
Abstract

We previously developed a deterministic record linkage algorithm demonstrating sensitivities approaching 90% while maintaining 100% specificity. Substantially better performance has been reported using probabilistic linkage techniques; however, such methods often incorporate human review into the process. To avoid human review, we employed an estimator function using the Expectation Maximization (EM) algorithm to establish a single true-link threshold. We compared the unsupervised probabilistic results against the manually reviewed gold-standard for two hospital registries, as well against our previous deterministic results. At an estimated specificity of 99.95%, actual specificities were 99.43% and 99.42% for registries A and B, respectively. At an estimated sensitivity of 99.95%, actual sensitivities were 99.19% and 98.99% for registries A and B, respectively. The EM algorithm estimated linkage parameters with acceptable accuracy, and was an improvement over the deterministic algorithm. Such a methodology may be used where record linkage is required, but human intervention is not possible or practical.

摘要

我们之前开发了一种确定性记录链接算法,该算法在保持100%特异性的同时,灵敏度接近90%。据报道,使用概率链接技术的性能要显著更好;然而,此类方法通常在过程中纳入人工审核。为避免人工审核,我们采用了一种使用期望最大化(EM)算法的估计函数来建立单个真实链接阈值。我们将无监督概率结果与两个医院登记处经人工审核的金标准进行了比较,同时也与我们之前的确定性结果进行了比较。在估计特异性为99.95%时,登记处A和B的实际特异性分别为99.43%和99.42%。在估计灵敏度为99.95%时,登记处A和B的实际灵敏度分别为99.19%和98.99%。EM算法以可接受的精度估计链接参数,并且是对确定性算法的一种改进。这种方法可用于需要记录链接但无法或不实际进行人工干预的情况。

相似文献

1
Analysis of a probabilistic record linkage technique without human review.一种无需人工审核的概率性记录链接技术分析。
AMIA Annu Symp Proc. 2003;2003:259-63.
2
Analysis of identifier performance using a deterministic linkage algorithm.使用确定性链接算法分析标识符性能。
Proc AMIA Symp. 2002:305-9.
3
A benchmark comparison of deterministic and probabilistic methods for defining manual review datasets in duplicate records reconciliation.在重复记录核对中定义人工审核数据集的确定性方法和概率性方法的基准比较。
J Am Med Inform Assoc. 2014 Jan-Feb;21(1):97-104. doi: 10.1136/amiajnl-2013-001744. Epub 2013 May 23.
4
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
5
When to conduct probabilistic linkage vs. deterministic linkage? A simulation study.何时进行概率性连锁分析与确定性连锁分析?一项模拟研究。
J Biomed Inform. 2015 Aug;56:80-6. doi: 10.1016/j.jbi.2015.05.012. Epub 2015 May 22.
6
Record linkage: making the most out of errors in linking variables.记录链接:充分利用链接变量中的错误。
AMIA Annu Symp Proc. 2006;2006:779-83.
7
Estimating parameters for probabilistic linkage of privacy-preserved datasets.估算隐私保护数据集概率关联的参数。
BMC Med Res Methodol. 2017 Jul 10;17(1):95. doi: 10.1186/s12874-017-0370-0.
8
[Inclusion of a deterministic post-processing stage to increase the performance of probabilistic record linkage].[纳入确定性后处理阶段以提高概率性记录链接的性能]
Cad Saude Publica. 2018 Jun 21;34(6):e00088117. doi: 10.1590/0102-311X00088117.
9
Record linkage software in the public domain: a comparison of Link Plus, The Link King, and a 'basic' deterministic algorithm.公共领域的记录链接软件:Link Plus、The Link King与一种“基本”确定性算法的比较
Health Informatics J. 2008 Mar;14(1):5-15. doi: 10.1177/1460458208088855.
10
Automatic record hash coding and linkage for epidemiological follow-up data confidentiality.用于流行病学随访数据保密的自动记录哈希编码与链接
Methods Inf Med. 1998 Sep;37(3):271-7.

引用本文的文献

1
Linking Patient Encounters across Primary and Ancillary Electronic Health Record Systems: A Comparison of Two Approaches.连接初级和辅助电子健康记录系统中的患者诊疗信息:两种方法的比较
ACI open. 2024 Jan;8(1):e43-e48. doi: 10.1055/s-0044-1782679. Epub 2024 Apr 10.
2
Record Linkage for Malaria Deaths Data Recovery and Surveillance in Brazil.巴西疟疾死亡数据恢复与监测的记录链接
Trop Med Infect Dis. 2023 Dec 14;8(12):519. doi: 10.3390/tropicalmed8120519.
3
An open-source probabilistic record linkage process for records with family-level information: Simulation study and applied analysis.具有家庭级信息的记录的开源概率记录链接过程:模拟研究和应用分析。
PLoS One. 2023 Oct 20;18(10):e0291581. doi: 10.1371/journal.pone.0291581. eCollection 2023.
4
Establishing a clinical informatics umbilical cord: lessons learned in launching infrastructure to support dyadic mother/infant primary care.建立临床信息学脐带:启动支持母婴二元初级护理基础设施的经验教训。
JAMIA Open. 2023 Aug 18;6(3):ooad065. doi: 10.1093/jamiaopen/ooad065. eCollection 2023 Oct.
5
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation.去标识化贝叶斯个人身份匹配用于隐私保护记录链接,即使存在错误:开发和验证。
BMC Med Inform Decis Mak. 2023 May 5;23(1):85. doi: 10.1186/s12911-023-02176-6.
6
Assessing the impact of privacy-preserving record linkage on record overlap and patient demographic and clinical characteristics in PCORnet®, the National Patient-Centered Clinical Research Network.评估在国家以患者为中心的临床研究网络PCORnet®中,隐私保护记录链接对记录重叠以及患者人口统计学和临床特征的影响。
J Am Med Inform Assoc. 2023 Feb 16;30(3):447-455. doi: 10.1093/jamia/ocac229.
7
The Data-Adaptive Fellegi-Sunter Model for Probabilistic Record Linkage: Algorithm Development and Validation for Incorporating Missing Data and Field Selection.数据自适应 Fellegi-Sunter 模型在概率记录链接中的应用:纳入缺失数据和字段选择的算法开发和验证。
J Med Internet Res. 2022 Sep 29;24(9):e33775. doi: 10.2196/33775.
8
Patient-Centered Data Home: A Path Towards National Interoperability.以患者为中心的数据之家:通往全国互操作性的途径。
Front Digit Health. 2022 Jul 13;4:887015. doi: 10.3389/fdgth.2022.887015. eCollection 2022.
9
Evaluation of real-world referential and probabilistic patient matching to advance patient identification strategy.真实世界参考和概率患者匹配评估,以推进患者识别策略。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1409-1415. doi: 10.1093/jamia/ocac068.
10
Identifying nonfatal firearm assault incidents through linking police data and clinical records: Cohort study in Indianapolis, Indiana, 2007-2016.通过将警方数据与临床记录相联系来识别非致命性枪支攻击事件:印第安纳波利斯,印第安纳州,2007-2016 年的队列研究。
Prev Med. 2021 Aug;149:106605. doi: 10.1016/j.ypmed.2021.106605. Epub 2021 May 13.

本文引用的文献

1
Analysis of identifier performance using a deterministic linkage algorithm.使用确定性链接算法分析标识符性能。
Proc AMIA Symp. 2002:305-9.
2
Record linkage of healthcare insurance claims.医疗保险理赔记录链接。
Stud Health Technol Inform. 2001;84(Pt 2):1409-13.
3
Adding value to clinical data by linkage to a public death registry.通过与公共死亡登记处建立联系为临床数据增添价值。
Stud Health Technol Inform. 2001;84(Pt 2):1384-8.
4
Issues in identification and linkage of patient records across an integrated delivery system.综合医疗服务体系中患者记录的识别与关联问题。
J Healthc Inf Manag. 1998 Fall;12(3):43-52.
5
Canopy computing: using the Web in clinical practice.云顶计算:在临床实践中运用网络
JAMA. 1998 Oct 21;280(15):1325-9. doi: 10.1001/jama.280.15.1325.
6
Use of commercial record linkage software and vital statistics to identify patient deaths.使用商业记录链接软件和人口动态统计数据来识别患者死亡情况。
J Am Med Inform Assoc. 1997 May-Jun;4(3):233-7. doi: 10.1136/jamia.1997.0040233.
7
Validating patient names in an integrated clinical information system.在综合临床信息系统中验证患者姓名。
Proc Annu Symp Comput Appl Med Care. 1991:588-92.