• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

确定性记录链接与相似性函数:巴西健康数据库研究

Deterministic record linkage versus similarity functions: a study in health databases from Brazil.

作者信息

Suzuki Kátia Mitiko Firmino, Porto Filho Carlos Humberto, Cozin Luís Fernando, Pereyra Lucas Calabrez, de Azevedo Marques Paulo Mazzoncini

机构信息

School of Medicine of Ribeirao Preto (FMRP), University of Sao Paulo (USP), Brazil.

出版信息

Stud Health Technol Inform. 2013;192:562-6.

PMID:23920618
Abstract

The record linkage is a strategy that allows linking different databases of information from patient records. Adopting the deterministic method and similarity functions (Dice, Jaro, Jaro-Winkler and Levenshtein) for the integration of heterogeneous databases aimed at different levels of health care Brazilian (primary, secondary and tertiary). The sensitivity of deterministic method was 54.5% (95% CI: 50.4 to 58.5). The best result obtained with the dissent of only one variable (mother's name) was 80.6% (95% CI: 77.2 to 83.6) and the best result obtained using the similarity function Jaro-Winkler was 91.8% (95% CI: 89.4 to 93.9). The deterministic method has high specificity but sensitivity can be reduced by the existence of spellings and typing errors in the databases. Thus, the step-by-step approach where there was disagreement in at least one of the relationship variable can increase the sensitivity of the method and the use of similarity functions.

摘要

记录链接是一种允许将来自患者记录的不同信息数据库进行链接的策略。采用确定性方法和相似性函数(戴斯系数、贾罗相似度、贾罗-温克勒相似度和莱文斯坦距离)来整合针对巴西不同医疗保健层面(初级、中级和高级)的异构数据库。确定性方法的敏感度为54.5%(95%置信区间:50.4至58.5)。仅排除一个变量(母亲姓名)时获得的最佳结果为80.6%(95%置信区间:77.2至83.6),使用贾罗-温克勒相似性函数获得的最佳结果为91.8%(95%置信区间:89.4至93.9)。确定性方法具有较高的特异性,但由于数据库中存在拼写和录入错误,敏感度可能会降低。因此,在至少一个关系变量存在不一致的情况下采用逐步方法,可以提高该方法的敏感度以及相似性函数的使用效果。

相似文献

1
Deterministic record linkage versus similarity functions: a study in health databases from Brazil.确定性记录链接与相似性函数:巴西健康数据库研究
Stud Health Technol Inform. 2013;192:562-6.
2
Real world performance of approximate string comparators for use in patient matching.用于患者匹配的近似字符串比较器的实际性能。
Stud Health Technol Inform. 2004;107(Pt 1):43-7.
3
The development of a data-matching algorithm to define the 'case patient'.用于定义“病例患者”的数据匹配算法的开发。
Aust Health Rev. 2013 Feb;37(1):54-9. doi: 10.1071/AH11161.
4
Integrating population- and patient-level data for secondary use of electronic health records to study overweight and obesity.整合人群和患者层面的数据以二次利用电子健康记录来研究超重和肥胖。
Stud Health Technol Inform. 2013;192:1100.
5
Where No Universal Health Care Identifier Exists: Comparison and Determination of the Utility of Score-Based Persons Matching Algorithms Using Demographic Data.在不存在通用医疗保健标识符的情况下:使用人口统计学数据对基于分数的人员匹配算法的效用进行比较和判定。
JMIR Public Health Surveill. 2018 Dec 13;4(4):e10436. doi: 10.2196/10436.
6
Construction of the integrated multicentre discharge summary database.综合多中心出院小结数据库的构建。
Stud Health Technol Inform. 2013;192:1064.
7
A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases.两种记录链接算法在从巴西公共数据库中检索艾滋病毒/艾滋病患者生命状态信息方面的准确性和计算可行性比较。
Int J Med Inform. 2018 Jun;114:45-51. doi: 10.1016/j.ijmedinf.2018.03.005. Epub 2018 Mar 20.
8
Clinical application of the integrated multicenter discharge summary database.综合多中心出院小结数据库的临床应用
Stud Health Technol Inform. 2015;216:1120.
9
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
10
Comparison of clinical knowledge bases for summarization of electronic health records.用于电子健康记录摘要的临床知识库比较
Stud Health Technol Inform. 2013;192:1217.

引用本文的文献

1
Missing Cases of Bacteriologically Confirmed TB/DR-TB from the National Treatment Registers in West and North Sumatra Provinces, Indonesia.印度尼西亚西苏门答腊省和北苏门答腊省国家治疗登记册中细菌学确诊肺结核/耐多药肺结核的漏报病例
Trop Med Infect Dis. 2023 Jan 2;8(1):31. doi: 10.3390/tropicalmed8010031.