Faculty of Pharmacy, Université de Montréal, Montréal, Québec, Canada.
Statistics Canada, Ottawa, Ontario, Canada.
Biometrics. 2020 Mar;76(1):145-157. doi: 10.1111/biom.13135. Epub 2019 Nov 6.
Causal inference methods have been developed for longitudinal observational study designs where confounding is thought to occur over time. In particular, one may estimate and contrast the population mean counterfactual outcome under specific exposure patterns. In such contexts, confounders of the longitudinal treatment-outcome association are generally identified using domain-specific knowledge. However, this may leave an analyst with a large set of potential confounders that may hinder estimation. Previous approaches to data-adaptive model selection for this type of causal parameter were limited to the single time-point setting. We develop a longitudinal extension of a collaborative targeted minimum loss-based estimation (C-TMLE) algorithm that can be applied to perform variable selection in the models for the probability of treatment with the goal of improving the estimation of the population mean counterfactual outcome under a fixed exposure pattern. We investigate the properties of this method through a simulation study, comparing it to G-Computation and inverse probability of treatment weighting. We then apply the method in a real-data example to evaluate the safety of trimester-specific exposure to inhaled corticosteroids during pregnancy in women with mild asthma. The data for this study were obtained from the linkage of electronic health databases in the province of Quebec, Canada. The C-TMLE covariate selection approach allowed for a reduction of the set of potential confounders, which included baseline and longitudinal variables.
因果推断方法已经被开发出来,用于纵向观察性研究设计,其中认为混杂因素会随着时间的推移而发生。特别是,人们可以估计和对比特定暴露模式下的总体平均反事实结果。在这种情况下,纵向治疗结果关联的混杂因素通常是使用特定领域的知识来识别的。然而,这可能会给分析人员留下一大组可能阻碍估计的潜在混杂因素。以前针对这种因果参数的数据自适应模型选择方法仅限于单点设置。我们开发了一种基于协同目标最小损失估计(C-TMLE)算法的纵向扩展,该算法可用于对治疗概率模型进行变量选择,目的是改进固定暴露模式下总体平均反事实结果的估计。我们通过模拟研究来研究该方法的性质,将其与 G-计算和治疗倾向逆概率加权进行比较。然后,我们将该方法应用于一个真实数据示例,以评估轻度哮喘孕妇在特定妊娠期间吸入皮质类固醇的安全性。该研究的数据来自加拿大魁北克省电子健康数据库的链接。C-TMLE 协变量选择方法允许减少潜在混杂因素的数量,包括基线和纵向变量。