Suppr超能文献

通过双重抽样对缺失结果数据估计加权分位数治疗效果。

Estimating weighted quantile treatment effects with missing outcome data by double sampling.

作者信息

Sun Shuo, Haneuse Sebastien, Levis Alexander W, Lee Catherine, Arterburn David E, Fischer Heidi, Shortreed Susan, Mukherjee Rajarshi

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States.

Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213, United States.

出版信息

Biometrics. 2025 Apr 2;81(2). doi: 10.1093/biomtc/ujaf038.

Abstract

Causal weighted quantile treatment effects (WQTEs) complement standard mean-focused causal contrasts when interest lies at the tails of the counterfactual distribution. However, existing methods for estimating and inferring causal WQTEs assume complete data on all relevant factors, which is often not the case in practice, particularly when the data are not collected for research purposes, such as electronic health records (EHRs) and disease registries. Furthermore, these data may be particularly susceptible to the outcome data being missing-not-at-random (MNAR). This paper proposes to use double sampling, through which the otherwise missing data are ascertained on a sub-sample of study units, as a strategy to mitigate bias due to MNAR data in estimating causal WQTEs. With the additional data, we present identifying conditions that do not require missingness assumptions in the original data. We then propose a novel inverse-probability weighted estimator and derive its asymptotic properties, both pointwise at specific quantiles and uniformly across quantiles over some compact subset of (0,1), allowing the propensity score and double-sampling probabilities to be estimated. For practical inference, we develop a bootstrap method that can be used for both pointwise and uniform inference. A simulation study is conducted to examine the finite sample performance of the proposed estimators. We illustrate the proposed method using EHR data examining the relative effects of 2 bariatric surgery procedures on BMI loss 3 years post-surgery.

摘要

当关注反事实分布的尾部时,因果加权分位数处理效应(WQTEs)对标准的以均值为重点的因果对比起到补充作用。然而,现有的估计和推断因果WQTEs的方法假定所有相关因素的数据是完整的,但在实际中情况往往并非如此,特别是当数据并非为研究目的而收集时,例如电子健康记录(EHRs)和疾病登记处的数据。此外,这些数据可能特别容易出现非随机缺失(MNAR)的结果数据。本文提出使用双重抽样,通过这种方式在研究单位的子样本上确定原本缺失的数据,作为一种策略来减轻在估计因果WQTEs时由于MNAR数据导致的偏差。利用这些额外的数据,我们提出了识别条件,这些条件在原始数据中不需要缺失性假设。然后,我们提出了一种新颖的逆概率加权估计器,并推导了其渐近性质,包括在特定分位数处的逐点渐近性质以及在(0,1)的某个紧致子集上跨分位数的一致渐近性质,使得倾向得分和双重抽样概率能够被估计。对于实际推断,我们开发了一种可用于逐点推断和一致推断的自助法。进行了一项模拟研究以检验所提出估计器的有限样本性能。我们使用EHR数据说明了所提出的方法,该数据用于研究两种减肥手术程序对术后3年体重指数降低的相对影响。

相似文献

2
Double Sampling for Informatively Missing Data in Electronic Health Record-Based Comparative Effectiveness Research.
Stat Med. 2024 Dec 30;43(30):6086-6098. doi: 10.1002/sim.10298. Epub 2024 Dec 5.
3
Identifiability and estimation of causal mediation effects with missing data.
Stat Med. 2017 Nov 10;36(25):3948-3965. doi: 10.1002/sim.7413. Epub 2017 Aug 7.
4
Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study.
Int J Biostat. 2019 Feb 26;15(2):/j/ijb.2019.15.issue-2/ijb-2017-0054/ijb-2017-0054.xml. doi: 10.1515/ijb-2017-0054.
6
Multiply robust estimation of causal quantile treatment effects.
Stat Med. 2020 Dec 10;39(28):4238-4251. doi: 10.1002/sim.8722. Epub 2020 Aug 28.
7
Quantile outcome adaptive lasso: Covariate selection for inverse probability weighting estimator of quantile treatment effects.
Stat Methods Med Res. 2025 Jan;34(1):69-84. doi: 10.1177/09622802241299410. Epub 2024 Dec 12.
8
10
Improving causal inference with a doubly robust estimator that combines propensity score stratification and weighting.
J Eval Clin Pract. 2017 Aug;23(4):697-702. doi: 10.1111/jep.12714. Epub 2017 Jan 24.

本文引用的文献

1
Double Sampling for Informatively Missing Data in Electronic Health Record-Based Comparative Effectiveness Research.
Stat Med. 2024 Dec 30;43(30):6086-6098. doi: 10.1002/sim.10298. Epub 2024 Dec 5.
3
Quantile regression for nonignorable missing data with its application of analyzing electronic medical records.
Biometrics. 2023 Sep;79(3):2036-2049. doi: 10.1111/biom.13723. Epub 2022 Aug 4.
5
Adjusting for selection bias due to missing data in electronic health records-based research.
Stat Methods Med Res. 2021 Oct;30(10):2221-2238. doi: 10.1177/09622802211027601. Epub 2021 Aug 26.
6
Investigating Bias from Missing Data in an Electronic Health Records-Based Study of Weight Loss After Bariatric Surgery.
Obes Surg. 2021 May;31(5):2125-2135. doi: 10.1007/s11695-021-05226-y. Epub 2021 Jan 19.
8
Weight Outcomes of Sleeve Gastrectomy and Gastric Bypass Compared to Nonsurgical Treatment.
Ann Surg. 2021 Dec 1;274(6):e1269-e1276. doi: 10.1097/SLA.0000000000003826.
9
A General Framework for Quantile Estimation with Incomplete Data.
J R Stat Soc Series B Stat Methodol. 2019 Apr;81(2):305-333. doi: 10.1111/rssb.12309. Epub 2019 Jan 6.
10
Two-Phase, Generalized Case-Control Designs for the Study of Quantitative Longitudinal Outcomes.
Am J Epidemiol. 2020 Feb 28;189(2):81-90. doi: 10.1093/aje/kwz127.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验