Chen Dong, Wang Yuquan, Shi Dapeng, Cao Yunlong, Hu Yue-Qing
State Key Laboratory of Genetic Engineering, Human Phenome Institute, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China.
Shanghai Center for Mathematical Sciences, Fudan University, Shanghai, China.
Stat Med. 2024 Dec 30;43(30):5814-5836. doi: 10.1002/sim.10269. Epub 2024 Nov 18.
The instrumental variable method is widely used in causal inference research to improve the accuracy of estimating causal effects. However, the weak correlation between instruments and exposure, as well as the direct impact of instruments on the outcome, can lead to biased estimates. To mitigate the bias introduced by such instruments in nonlinear causal inference, we propose a two-stage nonlinear causal effect estimation based on model averaging. The model uses different subsets of instruments in the first stage to predict exposure after a nonlinear transformation with the help of sliced inverse regression. In the second stage, adaptive Lasso penalty is applied to instruments to obtain the estimation of causal effect. We prove that the proposed estimator exhibits favorable asymptotic properties and evaluate its performance through a series of numerical studies, demonstrating its effectiveness in identifying nonlinear causal effects and its capability to handle scenarios with weak and invalid instruments. We apply the proposed method to the Atherosclerosis Risk in Communities dataset to investigate the relationship between BMI and hypertension.
工具变量法在因果推断研究中被广泛应用,以提高因果效应估计的准确性。然而,工具变量与暴露之间的弱相关性以及工具变量对结果的直接影响,可能导致估计偏差。为了减轻此类工具变量在非线性因果推断中引入的偏差,我们提出了一种基于模型平均的两阶段非线性因果效应估计方法。该模型在第一阶段使用不同的工具变量子集,借助切片逆回归对暴露进行非线性变换后预测暴露。在第二阶段,对工具变量应用自适应Lasso惩罚以获得因果效应估计。我们证明了所提出的估计量具有良好的渐近性质,并通过一系列数值研究评估了其性能,证明了其在识别非线性因果效应方面的有效性以及处理弱工具变量和无效工具变量情况的能力。我们将所提出的方法应用于社区动脉粥样硬化风险数据集,以研究体重指数与高血压之间的关系。