• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用带有数据值先验的多重插补对记录链接数据进行分析。

The analysis of record-linked data using multiple imputation with data value priors.

机构信息

Medical Research Council Centre of Epidemiology for Child health, University College London Institute of Child health, London, WC1N 1EH, UK.

出版信息

Stat Med. 2012 Dec 10;31(28):3481-93. doi: 10.1002/sim.5508. Epub 2012 Jul 17.

DOI:10.1002/sim.5508
PMID:22807145
Abstract

Probabilistic record linkage techniques assign match weights to one or more potential matches for those individual records that cannot be assigned 'unequivocal matches' across data files. Existing methods select the single record having the maximum weight provided that this weight is higher than an assigned threshold. We argue that this procedure, which ignores all information from matches with lower weights and for some individuals assigns no match, is inefficient and may also lead to biases in subsequent analysis of the linked data. We propose that a multiple imputation framework be utilised for data that belong to records that cannot be matched unequivocally. In this way, the information from all potential matches is transferred through to the analysis stage. This procedure allows for the propagation of matching uncertainty through a full modelling process that preserves the data structure. For purposes of statistical modelling, results from a simulation example suggest that a full probabilistic record linkage is unnecessary and that standard multiple imputation will provide unbiased and efficient parameter estimates.

摘要

概率记录链接技术为那些在多个数据文件中无法被明确匹配的个体记录分配匹配权重。现有的方法选择具有最大权重的单个记录,前提是该权重高于指定的阈值。我们认为,这种方法忽略了所有权重较低的匹配信息,并且对于某些个体没有分配任何匹配,是低效的,并且可能会导致在后续对链接数据的分析中产生偏差。我们建议对于那些无法明确匹配的记录所属的数据使用多重插补框架。通过这种方式,所有潜在匹配的信息都可以传递到分析阶段。该方法允许通过完整的建模过程来传播匹配不确定性,同时保留数据结构。出于统计建模的目的,模拟示例的结果表明,完整的概率记录链接是不必要的,标准的多重插补将提供无偏且有效的参数估计。

相似文献

1
The analysis of record-linked data using multiple imputation with data value priors.使用带有数据值先验的多重插补对记录链接数据进行分析。
Stat Med. 2012 Dec 10;31(28):3481-93. doi: 10.1002/sim.5508. Epub 2012 Jul 17.
2
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
3
Evaluating bias due to data linkage error in electronic healthcare records.评估电子医疗记录中因数据链接错误导致的偏差。
BMC Med Res Methodol. 2014 Mar 5;14:36. doi: 10.1186/1471-2288-14-36.
4
[Markov Chain Monte Carlo Method of multiple imputation for longitudinal data with missing values in the survey of maternal and children health].[妇幼健康调查中具有缺失值的纵向数据多重填补的马尔可夫链蒙特卡罗方法]
Sichuan Da Xue Xue Bao Yi Xue Ban. 2005 May;36(3):422-5.
5
Evaluating the use of existing data sources, probabilistic linkage, and multiple imputation to build population-based injury databases across phases of trauma care.评估利用现有数据源、概率性链接和多重插补在创伤救治各阶段构建基于人群的伤害数据库。
Acad Emerg Med. 2012 Apr;19(4):469-80. doi: 10.1111/j.1553-2712.2012.01324.x.
6
Categorical linkage-data analysis.分类关联数据分析。
Stat Med. 2024 Aug 15;43(18):3463-3483. doi: 10.1002/sim.10134. Epub 2024 Jun 10.
7
Multiple imputation for handling missing outcome data when estimating the relative risk.采用多重插补处理估计相对危险度时丢失的结局数据。
BMC Med Res Methodol. 2017 Sep 6;17(1):134. doi: 10.1186/s12874-017-0414-5.
8
Unit information Dirichlet process prior.单位信息狄利克雷过程先验。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae091.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Comprehensive implementations of multiple imputation using retrieved dropouts for continuous endpoints.使用检索到的失访数据对连续终点进行多重填补的综合实施方法。
BMC Med Res Methodol. 2025 Feb 21;25(1):47. doi: 10.1186/s12874-025-02494-5.

引用本文的文献

1
Survival analysis under imperfect record linkage using historic census data.基于历史人口普查数据的不完全记录链接下的生存分析。
BMC Med Res Methodol. 2024 Mar 13;24(1):67. doi: 10.1186/s12874-024-02194-6.
2
Optimizing the Retrieval of the Vital Status of Cancer Patients for Health Data Warehouses by Using Open Government Data in France.利用法国开放政府数据优化健康数据仓库中癌症患者生命状态的检索
Int J Environ Res Public Health. 2022 Apr 2;19(7):4272. doi: 10.3390/ijerph19074272.
3
Linking education and hospital data in England: linkage process and quality.
链接英格兰的教育和医院数据:链接过程和质量。
Int J Popul Data Sci. 2021 Sep 16;6(1):1671. doi: 10.23889/ijpds.v6i1.1671. eCollection 2021.
4
Probabilistic linkage without personal information successfully linked national clinical datasets.无需个人信息的概率链接成功链接了国家临床数据集。
J Clin Epidemiol. 2021 Aug;136:136-145. doi: 10.1016/j.jclinepi.2021.04.015. Epub 2021 Apr 28.
5
Assessing data linkage quality in cohort studies.评估队列研究中的数据链接质量。
Ann Hum Biol. 2020 Mar;47(2):218-226. doi: 10.1080/03014460.2020.1742379.
6
Reflections on modern methods: linkage error bias.关于现代方法的思考:连锁错误偏差。
Int J Epidemiol. 2019 Dec 1;48(6):2050-2060. doi: 10.1093/ije/dyz203.
7
Demystifying probabilistic linkage: Common myths and misconceptions.揭开概率关联的神秘面纱:常见的误解与错误观念。
Int J Popul Data Sci. 2018 Jan 10;3(1):410. doi: 10.23889/ijpds.v3i1.410.
8
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
9
Challenges in administrative data linkage for research.研究中行政数据链接的挑战。
Big Data Soc. 2017 Dec 5;4(2):2053951717745678. doi: 10.1177/2053951717745678.
10
Historical Census Record Linkage.历史人口普查记录关联
Annu Rev Sociol. 2018 Jul;44:19-37. doi: 10.1146/annurev-soc-073117-041447. Epub 2018 May 18.