• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用图论方法识别记录链接中的错误。

Use of graph theory measures to identify errors in record linkage.

作者信息

Randall Sean M, Boyd James H, Ferrante Anna M, Bauer Jacqueline K, Semmens James B

机构信息

Centre for Data Linkage, Curtin University, Kent Street, Bentley, WA 6102, Australia.

出版信息

Comput Methods Programs Biomed. 2014 Jul;115(2):55-63. doi: 10.1016/j.cmpb.2014.03.008. Epub 2014 Apr 3.

DOI:10.1016/j.cmpb.2014.03.008
PMID:24768079
Abstract

Ensuring high linkage quality is important in many record linkage applications. Current methods for ensuring quality are manual and resource intensive. This paper seeks to determine the effectiveness of graph theory techniques in identifying record linkage errors. A range of graph theory techniques was applied to two linked datasets, with known truth sets. The ability of graph theory techniques to identify groups containing errors was compared to a widely used threshold setting technique. This methodology shows promise; however, further investigations into graph theory techniques are required. The development of more efficient and effective methods of improving linkage quality will result in higher quality datasets that can be delivered to researchers in shorter timeframes.

摘要

在许多记录链接应用中,确保高链接质量很重要。当前用于确保质量的方法是人工的且资源密集。本文旨在确定图论技术在识别记录链接错误方面的有效性。一系列图论技术被应用于两个带有已知真值集的链接数据集。将图论技术识别包含错误组的能力与一种广泛使用的阈值设置技术进行了比较。这种方法显示出了前景;然而,需要对图论技术进行进一步研究。开发更高效有效的提高链接质量的方法将产生更高质量的数据集,这些数据集能够在更短的时间内交付给研究人员。

相似文献

1
Use of graph theory measures to identify errors in record linkage.使用图论方法识别记录链接中的错误。
Comput Methods Programs Biomed. 2014 Jul;115(2):55-63. doi: 10.1016/j.cmpb.2014.03.008. Epub 2014 Apr 3.
2
Controlling false match rates in record linkage using extreme value theory.利用极值理论控制记录匹配中的错误匹配率。
J Biomed Inform. 2011 Aug;44(4):648-54. doi: 10.1016/j.jbi.2011.02.008. Epub 2011 Feb 23.
3
The dGrail toolkit for iterative deterministic record linkage.
AMIA Annu Symp Proc. 2007 Oct 11:951.
4
Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage.模拟数据集的结果:概率记录链接优于确定性记录链接。
J Clin Epidemiol. 2011 May;64(5):565-72. doi: 10.1016/j.jclinepi.2010.05.008. Epub 2010 Oct 16.
5
Record linkage is feasible with non-identifiable trauma and rehabilitation datasets.记录链接对于不可识别的创伤和康复数据集是可行的。
Aust N Z J Public Health. 2016 Jun;40(3):245-9. doi: 10.1111/1753-6405.12510. Epub 2016 Mar 30.
6
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
7
Comparing record linkage software programs and algorithms using real-world data.使用真实世界的数据比较记录链接软件程序和算法。
PLoS One. 2019 Sep 24;14(9):e0221459. doi: 10.1371/journal.pone.0221459. eCollection 2019.
8
A new computationally efficient algorithm for record linkage with field dependency and missing data imputation.一种新的具有字段依赖性和缺失数据插补功能的计算效率高的记录链接算法。
Int J Med Inform. 2018 Jan;109:70-75. doi: 10.1016/j.ijmedinf.2017.10.021. Epub 2017 Nov 6.
9
Sociodemographic differences in linkage error: an examination of four large-scale datasets.连锁错误中的社会人口学差异:对四个大规模数据集的考察
BMC Health Serv Res. 2018 Sep 3;18(1):678. doi: 10.1186/s12913-018-3495-x.
10
An empirical comparison of record linkage procedures.记录链接程序的实证比较。
Stat Med. 2002 May 30;21(10):1485-96. doi: 10.1002/sim.1147.

引用本文的文献

1
Sociodemographic differences in linkage error: an examination of four large-scale datasets.连锁错误中的社会人口学差异:对四个大规模数据集的考察
BMC Health Serv Res. 2018 Sep 3;18(1):678. doi: 10.1186/s12913-018-3495-x.
2
Estimating parameters for probabilistic linkage of privacy-preserved datasets.估算隐私保护数据集概率关联的参数。
BMC Med Res Methodol. 2017 Jul 10;17(1):95. doi: 10.1186/s12874-017-0370-0.
3
Ensuring Privacy When Integrating Patient-Based Datasets: New Methods and Developments in Record Linkage.整合基于患者的数据集时确保隐私:记录链接的新方法与进展
Front Public Health. 2017 Mar 2;5:34. doi: 10.3389/fpubh.2017.00034. eCollection 2017.