Gao Qian, Wang Jiale, Fang Ruiling, Sun Hongwei, Wang Tong
Department of Health Statistics, School of Public Health, MOE Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, No.56 Xinjian South Road, Taiyuan, 030001, China.
Department of Health Statistics, School of Public Health, Binzhou Medical University, Yantai, China.
BMC Med Res Methodol. 2025 Feb 13;25(1):35. doi: 10.1186/s12874-025-02488-3.
Generalized propensity score (GPS) methods have become popular for estimating causal relationships between a continuous treatment and an outcome in observational studies with rich covariate information. The presence of rich covariates enhances the plausibility of the unconfoundedness assumption. Nonetheless, it is also crucial to ensure the correct specification of both marginal and conditional treatment distributions, beyond the assumption of unconfoundedness.
We address limitations in existing GPS methods by extending balance-based approaches to high dimensions and introducing the Generalized Outcome-Adaptive LASSO and Doubly Robust Estimate (GOALDeR). This novel approach integrates a balance-based method that is robust to the misspecification of distributions required for GPS methods, a doubly robust estimator that is robust to the misspecification of models, and a variable selection technique for causal inference that ensures an unbiased and statistically efficient estimation.
Simulation studies showed that GOALDeR was able to generate nearly unbiased estimates when either the GPS model or the outcome model was correctly specified. Notably, GOALDeR demonstrated greater precision and accuracy compared to existing methods and was slightly affected by the covariate correlation structure and ratio of sample size to covariate dimension. Real data analysis revealed no statistically significant dose-response relationship between epigenetic age acceleration and Alzheimer's disease.
In this study, we proposed GOALDeR as an advanced GPS method for causal inference in high dimensions, and empirically demonstrated that GOALDeR is doubly robust, with improved accuracy and precision compared to existing methods. The R package is available at https://github.com/QianGao-SXMU/GOALDeR .
在具有丰富协变量信息的观察性研究中,广义倾向得分(GPS)方法已广泛用于估计连续治疗与结果之间的因果关系。丰富协变量的存在增强了无混杂假设的合理性。尽管如此,除了无混杂假设之外,确保正确指定边际和条件治疗分布也至关重要。
我们通过将基于平衡的方法扩展到高维并引入广义结果自适应套索和双重稳健估计(GOALDeR)来解决现有GPS方法的局限性。这种新方法整合了一种对GPS方法所需分布的错误指定具有鲁棒性的基于平衡的方法、一种对模型的错误指定具有鲁棒性的双重稳健估计器,以及一种用于因果推断的变量选择技术,可确保无偏且统计高效的估计。
模拟研究表明,当GPS模型或结果模型正确指定时,GOALDeR能够生成几乎无偏的估计。值得注意的是,与现有方法相比,GOALDeR具有更高的精度和准确性,并且受协变量相关结构以及样本量与协变量维度之比的影响较小。实际数据分析显示,表观遗传年龄加速与阿尔茨海默病之间不存在统计学上显著的剂量反应关系。
在本研究中,我们提出GOALDeR作为一种用于高维因果推断的先进GPS方法,并通过实证证明GOALDeR具有双重稳健性,与现有方法相比,其准确性和精度有所提高。R包可在https://github.com/QianGao-SXMU/GOALDeR获取。