Suppr超能文献

一种用于记录链接的缩放方法。

A scaling approach to record linkage.

作者信息

Goldstein Harvey, Harron Katie, Cortina-Borja Mario

机构信息

University of Bristol, Bristol, U.K.

University College London, London, U.K.

出版信息

Stat Med. 2017 Jul 20;36(16):2514-2521. doi: 10.1002/sim.7287. Epub 2017 Mar 16.

Abstract

With increasing availability of large datasets derived from administrative and other sources, there is an increasing demand for the successful linking of these to provide rich sources of data for further analysis. Variation in the quality of identifiers used to carry out linkage means that existing approaches are often based upon 'probabilistic' models, which are based on a number of assumptions, and can make heavy computational demands. In this paper, we suggest a new approach to classifying record pairs in linkage, based upon weights (scores) derived using a scaling algorithm. The proposed method does not rely on training data, is computationally fast, requires only moderate amounts of storage and has intuitive appeal. Copyright © 2017 John Wiley & Sons, Ltd.

摘要

随着从行政及其他来源获得的大型数据集越来越多,人们对成功链接这些数据集以提供丰富数据来源用于进一步分析的需求也日益增加。用于进行链接的标识符质量存在差异,这意味着现有方法通常基于“概率”模型,这些模型基于一些假设,并且可能需要大量计算。在本文中,我们提出了一种基于使用缩放算法得出的权重(分数)对链接中的记录对进行分类的新方法。所提出的方法不依赖训练数据,计算速度快,只需要适度的存储量,并且具有直观的吸引力。版权所有© 2017约翰·威利父子有限公司。

相似文献

1
A scaling approach to record linkage.一种用于记录链接的缩放方法。
Stat Med. 2017 Jul 20;36(16):2514-2521. doi: 10.1002/sim.7287. Epub 2017 Mar 16.
6
Probabilistic record linkage.概率性记录链接
Int J Epidemiol. 2016 Jun;45(3):954-64. doi: 10.1093/ije/dyv322. Epub 2015 Dec 20.

引用本文的文献

1
Synthetic data in health care: A narrative review.医疗保健中的合成数据:一篇叙述性综述。
PLOS Digit Health. 2023 Jan 6;2(1):e0000082. doi: 10.1371/journal.pdig.0000082. eCollection 2023 Jan.
4
Assessing data linkage quality in cohort studies.评估队列研究中的数据链接质量。
Ann Hum Biol. 2020 Mar;47(2):218-226. doi: 10.1080/03014460.2020.1742379.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验