Ko Yi-An, Mukherjee Bhramar, Smith Jennifer A, Park Sung Kyun, Kardia Sharon L R, Allison Matthew A, Vokonas Pantel S, Chen Jinbo, Diez-Roux Ana V
Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, U.S.A.
Stat Med. 2014 Dec 20;33(29):5177-91. doi: 10.1002/sim.6281. Epub 2014 Aug 11.
While there has been extensive research developing gene-environment interaction (GEI) methods in case-control studies, little attention has been given to sparse and efficient modeling of GEI in longitudinal studies. In a two-way table for GEI with rows and columns as categorical variables, a conventional saturated interaction model involves estimation of a specific parameter for each cell, with constraints ensuring identifiability. The estimates are unbiased but are potentially inefficient because the number of parameters to be estimated can grow quickly with increasing categories of row/column factors. On the other hand, Tukey's one-degree-of-freedom model for non-additivity treats the interaction term as a scaled product of row and column main effects. Because of the parsimonious form of interaction, the interaction estimate leads to enhanced efficiency, and the corresponding test could lead to increased power. Unfortunately, Tukey's model gives biased estimates and low power if the model is misspecified. When screening multiple GEIs where each genetic and environmental marker may exhibit a distinct interaction pattern, a robust estimator for interaction is important for GEI detection. We propose a shrinkage estimator for interaction effects that combines estimates from both Tukey's and saturated interaction models and use the corresponding Wald test for testing interaction in a longitudinal setting. The proposed estimator is robust to misspecification of interaction structure. We illustrate the proposed methods using two longitudinal studies-the Normative Aging Study and the Multi-ethnic Study of Atherosclerosis.
虽然在病例对照研究中已经有大量关于开发基因 - 环境相互作用(GEI)方法的研究,但纵向研究中GEI的稀疏且高效建模却很少受到关注。在一个以行为类别变量、列为类别变量的GEI双向表中,传统的饱和相互作用模型涉及为每个单元格估计一个特定参数,并通过约束确保可识别性。这些估计是无偏的,但可能效率不高,因为随着行/列因素类别的增加,要估计的参数数量会迅速增长。另一方面,用于非加性的Tukey单自由度模型将相互作用项视为行和列主效应的缩放乘积。由于相互作用的简约形式,相互作用估计提高了效率,相应的检验可能会提高检验效能。不幸的是,如果模型设定错误,Tukey模型会给出有偏估计且检验效能较低。当筛选多个GEI时,其中每个遗传和环境标记可能呈现出不同的相互作用模式,一个稳健的相互作用估计器对于GEI检测很重要。我们提出了一种用于相互作用效应的收缩估计器,它结合了Tukey模型和饱和相互作用模型的估计,并在纵向环境中使用相应的Wald检验来检验相互作用。所提出的估计器对相互作用结构的错误设定具有稳健性。我们使用两项纵向研究——规范老化研究和动脉粥样硬化多族裔研究来说明所提出的方法。