Vansteelandt Stijn, Vanderweele Tyler J, Robins James M
Department of Applied Mathematics and Computer Sciences Ghent University, 281 (S9) Krijgslaan, 9000 Ghent, Belgium.
J Am Stat Assoc. 2008 Dec 1;103(484):1693-1704. doi: 10.1198/016214508000001084.
A primary focus of an increasing number of scientific studies is to determine whether two exposures interact in the effect that they produce on an outcome of interest. Interaction is commonly assessed by fitting regression models in which the linear predictor includes the product between those exposures. When the main interest lies in the interaction, this approach is not entirely satisfactory because it is prone to (possibly severe) bias when the main exposure effects or the association between outcome and extraneous factors are misspecified. In this article, we therefore consider conditional mean models with identity or log link which postulate the statistical interaction in terms of a finite-dimensional parameter, but which are otherwise unspecified. We show that estimation of the interaction parameter is often not feasible in this model because it would require nonparametric estimation of auxiliary conditional expectations given high-dimensional variables. We thus consider 'multiply robust estimation' under a union model that assumes at least one of several working submodels holds. Our approach is novel in that it makes use of information on the joint distribution of the exposures conditional on the extraneous factors in making inferences about the interaction parameter of interest. In the special case of a randomized trial or a family-based genetic study in which the joint exposure distribution is known by design or by Mendelian inheritance, the resulting multiply robust procedure leads to asymptotically distribution-free tests of the null hypothesis of no interaction on an additive scale. We illustrate the methods via simulation and the analysis of a randomized follow-up study.
越来越多科学研究的一个主要重点是确定两种暴露因素在对感兴趣的结果产生的影响中是否相互作用。相互作用通常通过拟合回归模型来评估,其中线性预测变量包括这些暴露因素之间的乘积。当主要关注点在于相互作用时,这种方法并不完全令人满意,因为当主要暴露效应或结果与外部因素之间的关联被错误设定时,它容易产生(可能很严重的)偏差。因此,在本文中,我们考虑具有恒等或对数链接的条件均值模型,这些模型根据有限维参数假设统计相互作用,但在其他方面未作具体规定。我们表明,在这个模型中,相互作用参数的估计通常不可行,因为这需要对给定高维变量的辅助条件期望进行非参数估计。因此,我们在一个联合模型下考虑“多重稳健估计”,该模型假设几个工作子模型中至少有一个成立。我们的方法新颖之处在于,在推断感兴趣的相互作用参数时,它利用了在外部因素条件下暴露因素联合分布的信息。在随机试验或基于家系的基因研究的特殊情况下,其中联合暴露分布通过设计或孟德尔遗传已知,由此产生的多重稳健程序会导致在加性尺度上对无相互作用的零假设进行渐近无分布检验。我们通过模拟和对一项随机随访研究的分析来说明这些方法。