• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于具有分布在两个文件中的协变量的观察性研究中因果推断的回归辅助贝叶斯记录链接

Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.

作者信息

Guha Sharmistha, Reiter Jerome P

机构信息

Department of Statistics, Texas A&M University, College Station, 77843, TX, USA.

Department of Statistical Science, Duke University, Durham, 27708, NC, USA.

出版信息

J Stat Plan Inference. 2024 Mar;229. doi: 10.1016/j.jspi.2023.07.004. Epub 2023 Aug 1.

DOI:10.1016/j.jspi.2023.07.004
PMID:39076728
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11283754/
Abstract

We consider causal inference for observational studies with data spread over two files. One file includes the treatment, outcome, and some covariates measured on a set of individuals, and the other file includes additional causally-relevant covariates measured on a partially overlapping set of individuals. By linking records in the two databases, the analyst can control for more covariates, thereby reducing the risk of bias compared to using only one file alone. When analysts do not have access to a unique identifier that enables perfect, error-free linkages, they typically rely on probabilistic record linkage to construct a single linked data set, and estimate causal effects using these linked data. This typical practice does not propagate uncertainty from imperfect linkages to the causal inferences. Further, it does not take advantage of relationships among the variables to improve the linkage quality. We address these shortcomings by fusing regression-assisted, Bayesian probabilistic record linkage with causal inference. The Markov chain Monte Carlo sampler generates multiple plausible linked data files as byproducts that analysts can use for multiple imputation inferences. Here, we show results for two causal estimators based on propensity score overlap weights. Using simulations and data from the Italy Survey on Household Income and Wealth, we show that our approach can improve the accuracy of estimated treatment effects.

摘要

我们考虑对数据分布在两个文件中的观察性研究进行因果推断。一个文件包含对一组个体测量的治疗、结局和一些协变量,另一个文件包含对部分重叠个体集测量的其他与因果相关的协变量。通过链接两个数据库中的记录,与仅使用一个文件相比,分析师可以控制更多协变量,从而降低偏差风险。当分析师无法获得能够实现完美、无错误链接的唯一标识符时,他们通常依靠概率性记录链接来构建单个链接数据集,并使用这些链接数据估计因果效应。这种典型做法不会将不完美链接中的不确定性传播到因果推断中。此外,它没有利用变量之间的关系来提高链接质量。我们通过将回归辅助的贝叶斯概率性记录链接与因果推断相结合来解决这些缺点。马尔可夫链蒙特卡罗采样器生成多个合理的链接数据文件作为副产品,分析师可以将其用于多重插补推断。在此,我们展示了基于倾向得分重叠权重的两个因果估计量的结果。使用来自意大利家庭收入和财富调查的模拟数据和实际数据,我们表明我们的方法可以提高估计治疗效果的准确性。

相似文献

1
Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.用于具有分布在两个文件中的协变量的观察性研究中因果推断的回归辅助贝叶斯记录链接
J Stat Plan Inference. 2024 Mar;229. doi: 10.1016/j.jspi.2023.07.004. Epub 2023 Aug 1.
2
A MULTIPLE IMPUTATION PROCEDURE FOR RECORD LINKAGE AND CAUSAL INFERENCE TO ESTIMATE THE EFFECTS OF HOME-DELIVERED MEALS.一种用于记录链接和因果推断以估计上门送餐效果的多重填补程序。
Ann Appl Stat. 2021 Mar;15(1):412-436. doi: 10.1214/20-aoas1397. Epub 2021 Mar 18.
3
Simultaneous record linkage and causal inference with propensity score subclassification.同时进行倾向评分亚组分类的记录链接和因果推断。
Stat Med. 2018 Oct 30;37(24):3533-3546. doi: 10.1002/sim.7911. Epub 2018 Aug 1.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Bayesian record linkage with variables in one file.含单个文件中变量的贝叶斯记录链接
Stat Med. 2023 Nov 30;42(27):4931-4951. doi: 10.1002/sim.9894. Epub 2023 Aug 31.
6
Bayesian causal inference for observational studies with missingness in covariates and outcomes.贝叶斯因果推断在协变量和结局缺失的观察性研究中的应用。
Biometrics. 2023 Dec;79(4):3624-3636. doi: 10.1111/biom.13918. Epub 2023 Aug 8.
7
Issues in identification and linkage of patient records across an integrated delivery system.综合医疗服务体系中患者记录的识别与关联问题。
J Healthc Inf Manag. 1998 Fall;12(3):43-52.
8
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
9
Comparison of the ability of double-robust estimators to correct bias in propensity score matching analysis. A Monte Carlo simulation study.双重稳健估计在倾向评分匹配分析中校正偏差的能力比较。一项蒙特卡罗模拟研究。
Pharmacoepidemiol Drug Saf. 2017 Dec;26(12):1513-1519. doi: 10.1002/pds.4325. Epub 2017 Oct 6.
10
Improving causal inference with a doubly robust estimator that combines propensity score stratification and weighting.利用结合倾向评分分层和加权的双重稳健估计器改进因果推断。
J Eval Clin Pract. 2017 Aug;23(4):697-702. doi: 10.1111/jep.12714. Epub 2017 Jan 24.

引用本文的文献

1
Studying Chinese immigrants' spatial distribution in the Raleigh-Durham area by linking survey and commercial data using romanized names.通过使用罗马化姓名将调查数据与商业数据相链接,研究罗利-达勒姆地区华裔移民的空间分布情况。
J R Stat Soc Ser A Stat Soc. 2024 Oct 23;188(1):84-97. doi: 10.1093/jrsssa/qnae107. eCollection 2025 Jan.

本文引用的文献

1
Addressing Extreme Propensity Scores via the Overlap Weights.通过重叠权重解决极端倾向评分。
Am J Epidemiol. 2019 Jan 1;188(1):250-257. doi: 10.1093/aje/kwy201.
2
Simultaneous record linkage and causal inference with propensity score subclassification.同时进行倾向评分亚组分类的记录链接和因果推断。
Stat Med. 2018 Oct 30;37(24):3533-3546. doi: 10.1002/sim.7911. Epub 2018 Aug 1.
3
A Bayesian Procedure for File Linking to Analyze End-of-Life Medical Costs.一种用于文件链接以分析临终医疗费用的贝叶斯程序。
J Am Stat Assoc. 2013 Jan 1;108(501):34-47. doi: 10.1080/01621459.2012.726889.
4
Relationship between early physician follow-up and 30-day readmission among Medicare beneficiaries hospitalized for heart failure.医疗保险受益人因心力衰竭住院后,与 30 天再入院相关的早期医生随访情况。
JAMA. 2010 May 5;303(17):1716-22. doi: 10.1001/jama.2010.533.