• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯因果推断在协变量和结局缺失的观察性研究中的应用。

Bayesian causal inference for observational studies with missingness in covariates and outcomes.

机构信息

Heart Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA.

Division of Statistics and Data Science, University of Cincinnati, Cincinnati, Ohio, USA.

出版信息

Biometrics. 2023 Dec;79(4):3624-3636. doi: 10.1111/biom.13918. Epub 2023 Aug 8.

DOI:10.1111/biom.13918
PMID:37553770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10840608/
Abstract

Missing data are a pervasive issue in observational studies using electronic health records or patient registries. It presents unique challenges for statistical inference, especially causal inference. Inappropriately handling missing data in causal inference could potentially bias causal estimation. Besides missing data problems, observational health data structures typically have mixed-type variables - continuous and categorical covariates - whose joint distribution is often too complex to be modeled by simple parametric models. The existence of missing values in covariates and outcomes makes the causal inference even more challenging, while most standard causal inference approaches assume fully observed data or start their works after imputing missing values in a separate preprocessing stage. To address these problems, we introduce a Bayesian nonparametric causal model to estimate causal effects with missing data. The proposed approach can simultaneously impute missing values, account for multiple outcomes, and estimate causal effects under the potential outcomes framework. We provide three simulation studies to show the performance of our proposed method under complicated data settings whose features are similar to our case studies. For example, Simulation Study 3 assumes the case where missing values exist in both outcomes and covariates. Two case studies were conducted applying our method to evaluate the comparative effectiveness of treatments for chronic disease management in juvenile idiopathic arthritis and cystic fibrosis.

摘要

在使用电子健康记录或患者登记处的观察性研究中,缺失数据是一个普遍存在的问题。它对统计推断,特别是因果推断提出了独特的挑战。在因果推断中不恰当地处理缺失数据可能会潜在地偏置因果估计。除了缺失数据问题外,观察性健康数据结构通常具有混合类型变量 - 连续和分类协变量 - 其联合分布通常太复杂,无法通过简单的参数模型进行建模。协变量和结果中的缺失值的存在使得因果推断更加具有挑战性,而大多数标准的因果推断方法假设完全观察到的数据,或者在单独的预处理阶段对缺失值进行插补后开始工作。为了解决这些问题,我们引入了一种贝叶斯非参数因果模型来估计具有缺失数据的因果效应。所提出的方法可以同时插补缺失值,考虑多个结果,并在潜在结果框架下估计因果效应。我们提供了三项模拟研究,以在与我们的案例研究相似的复杂数据设置下展示我们提出的方法的性能。例如,模拟研究 3 假设在结果和协变量中都存在缺失值的情况。进行了两项案例研究,应用我们的方法来评估青少年特发性关节炎和囊性纤维化中慢性疾病管理治疗的比较效果。

相似文献

1
Bayesian causal inference for observational studies with missingness in covariates and outcomes.贝叶斯因果推断在协变量和结局缺失的观察性研究中的应用。
Biometrics. 2023 Dec;79(4):3624-3636. doi: 10.1111/biom.13918. Epub 2023 Aug 8.
2
Bayesian nonparametric generative models for causal inference with missing at random covariates.用于在协变量随机缺失情况下进行因果推断的贝叶斯非参数生成模型。
Biometrics. 2018 Dec;74(4):1193-1202. doi: 10.1111/biom.12875. Epub 2018 Mar 26.
3
Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。
BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.
4
Sequential BART for imputation of missing covariates.用于插补缺失协变量的顺序BART
Biostatistics. 2016 Jul;17(3):589-602. doi: 10.1093/biostatistics/kxw009. Epub 2016 Mar 15.
5
Handling missing data when estimating causal effects with targeted maximum likelihood estimation. 采用有向极大似然估计法估计因果效应时处理缺失数据。
Am J Epidemiol. 2024 Jul 8;193(7):1019-1030. doi: 10.1093/aje/kwae012.
6
A Bayesian nonparametric approach to causal inference on quantiles.一种用于分位数因果推断的贝叶斯非参数方法。
Biometrics. 2018 Sep;74(3):986-996. doi: 10.1111/biom.12863. Epub 2018 Feb 25.
7
Fully Bayesian inference under ignorable missingness in the presence of auxiliary covariates.存在辅助协变量时可忽略缺失情况下的全贝叶斯推断。
Biometrics. 2014 Mar;70(1):62-72. doi: 10.1111/biom.12121. Epub 2013 Dec 10.
8
Imputation approaches for potential outcomes in causal inference.因果推断中潜在结果的插补方法。
Int J Epidemiol. 2015 Oct;44(5):1731-7. doi: 10.1093/ije/dyv135. Epub 2015 Jul 25.
9
Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases.利用医疗保健数据库中的观察数据比较估计异质治疗效果的方法。
Stat Med. 2018 Oct 15;37(23):3309-3324. doi: 10.1002/sim.7820. Epub 2018 Jun 3.
10
Identifiability and estimation of causal mediation effects with missing data.存在缺失数据时因果中介效应的可识别性与估计
Stat Med. 2017 Nov 10;36(25):3948-3965. doi: 10.1002/sim.7413. Epub 2017 Aug 7.

引用本文的文献

1
Considerations for Causal Inference Studies.因果推断研究的注意事项。
Respirology. 2025 May;30(5):382-384. doi: 10.1111/resp.70018. Epub 2025 Mar 16.
2
Conceptual framework as a guide to choose appropriate imputation method for missing values in a clinical structured dataset.概念框架作为选择临床结构化数据集中缺失值的适当插补方法的指南。
BMC Med Res Methodol. 2025 Feb 20;25(1):43. doi: 10.1186/s12874-025-02496-3.
3
Adaptive Universal Principles for Real-world Observational Studies (AUPROS): an approach to designing real-world observational studies for clinical, epidemiologic, and precision oncology research.

本文引用的文献

1
Bayesian semi-parametric G-computation for causal inference in a cohort study with MNAR dropout and death.用于具有非随机缺失和死亡的队列研究中因果推断的贝叶斯半参数G计算法
J R Stat Soc Ser C Appl Stat. 2021 Mar;70(2):398-414. doi: 10.1111/rssc.12464. Epub 2021 Jan 6.
2
Informative missingness in electronic health record systems: the curse of knowing.电子健康记录系统中的信息性缺失:知晓之祸。
Diagn Progn Res. 2020 Jul 2;4:8. doi: 10.1186/s41512-020-00077-0. eCollection 2020.
3
Timing matters: real-world effectiveness of early combination of biologic and conventional synthetic disease-modifying antirheumatic drugs for treating newly diagnosed polyarticular course juvenile idiopathic arthritis.
真实世界观察性研究的适应性通用原则(AUPROS):一种为临床、流行病学和精准肿瘤学研究设计真实世界观察性研究的方法。
Br J Cancer. 2025 Feb;132(2):139-153. doi: 10.1038/s41416-024-02899-x. Epub 2024 Nov 21.
4
Identify the most appropriate imputation method for handling missing values in clinical structured datasets: a systematic review.识别处理临床结构化数据集缺失值的最合适插补方法:系统评价。
BMC Med Res Methodol. 2024 Aug 28;24(1):188. doi: 10.1186/s12874-024-02310-6.
5
Design, implementation, and inferential issues associated with clinical trials that rely on data in electronic medical records: a narrative review.依赖电子病历数据的临床试验的设计、实施和推论问题:叙述性综述。
BMC Med Res Methodol. 2023 Nov 16;23(1):271. doi: 10.1186/s12874-023-02102-4.
时机很重要:生物制剂和传统合成疾病修饰抗风湿药物早期联合治疗新诊断多关节病程幼年特发性关节炎的真实世界疗效。
RMD Open. 2020 Jan;6(1). doi: 10.1136/rmdopen-2019-001091.
4
Bayesian nonparametric generative models for causal inference with missing at random covariates.用于在协变量随机缺失情况下进行因果推断的贝叶斯非参数生成模型。
Biometrics. 2018 Dec;74(4):1193-1202. doi: 10.1111/biom.12875. Epub 2018 Mar 26.
5
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
6
Use of FEV in cystic fibrosis epidemiologic studies and clinical trials: A statistical perspective for the clinical researcher.用力呼气容积(FEV)在囊性纤维化流行病学研究和临床试验中的应用:临床研究者的统计学视角
J Cyst Fibros. 2017 May;16(3):318-326. doi: 10.1016/j.jcf.2017.01.002. Epub 2017 Jan 20.
7
The Cystic Fibrosis Foundation Patient Registry. Design and Methods of a National Observational Disease Registry.囊性纤维化基金会患者登记处。一个国家观察性疾病登记处的设计与方法。
Ann Am Thorac Soc. 2016 Jul;13(7):1173-9. doi: 10.1513/AnnalsATS.201511-781OC.
8
Incidence and prevalence of juvenile idiopathic arthritis among children in a managed care population, 1996-2009.1996-2009 年,在管理式医疗人群中儿童幼年特发性关节炎的发病率和患病率。
J Rheumatol. 2013 Jul;40(7):1218-25. doi: 10.3899/jrheum.120661. Epub 2013 Apr 15.
9
Importance of health information technology, electronic health records, and continuously aggregating data to comparative effectiveness research and learning health care.重视健康信息技术、电子健康记录以及不断积累数据,以进行比较效果研究和学习医疗保健。
J Clin Oncol. 2012 Dec 1;30(34):4243-8. doi: 10.1200/JCO.2012.42.8011. Epub 2012 Oct 15.
10
Targeted maximum likelihood based causal inference: Part I.基于靶向最大似然法的因果推断:第一部分。
Int J Biostat. 2010;6(2):Article 2. doi: 10.2202/1557-4679.1211.