Zhao Jinying, Jin Li, Xiong Momiao
Human Genetics Center, University of Texas Health Science Center at Houston, Houston, TX 77225, USA.
Am J Hum Genet. 2006 Nov;79(5):831-45. doi: 10.1086/508571. Epub 2006 Sep 21.
Despite the growing consensus on the importance of testing gene-gene interactions in genetic studies of complex diseases, the effect of gene-gene interactions has often been defined as a deviance from genetic additive effects, which is essentially treated as a residual term in genetic analysis and leads to low power in detecting the presence of interacting effects. To what extent the definition of gene-gene interaction at population level reflects the genes' biochemical or physiological interaction remains a mystery. In this article, we introduce a novel definition and a new measure of gene-gene interaction between two unlinked loci (or genes). We developed a general theory for studying linkage disequilibrium (LD) patterns in disease population under two-locus disease models. The properties of using the LD measure in a disease population as a function of the measure of gene-gene interaction between two unlinked loci were also investigated. We examined how interaction between two loci creates LD in a disease population and showed that the mathematical formulation of the new definition for gene-gene interaction between two loci was similar to that of the LD between two loci. This finding motived us to develop an LD-based statistic to detect gene-gene interaction between two unlinked loci. The null distribution and type I error rates of the LD-based statistic for testing gene-gene interaction were validated using extensive simulation studies. We found that the new test statistic was more powerful than the traditional logistic regression under three two-locus disease models and demonstrated that the power of the test statistic depends on the measure of gene-gene interaction. We also investigated the impact of using tagging SNPs for testing interaction on the power to detect interaction between two unlinked loci. Finally, to evaluate the performance of our new method, we applied the LD-based statistic to two published data sets. Our results showed that the P values of the LD-based statistic were smaller than those obtained by other approaches, including logistic regression models.
尽管在复杂疾病的基因研究中,对于检测基因 - 基因相互作用的重要性已达成越来越多的共识,但基因 - 基因相互作用的效应通常被定义为与基因加性效应的偏差,这在基因分析中基本上被视为一个残差项,导致检测相互作用效应存在时的功效较低。群体水平上基因 - 基因相互作用的定义在多大程度上反映了基因的生化或生理相互作用仍是一个谜。在本文中,我们引入了一种新的定义以及两个非连锁位点(或基因)之间基因 - 基因相互作用的新度量。我们针对双位点疾病模型开发了一种研究疾病群体中连锁不平衡(LD)模式的通用理论。还研究了在疾病群体中使用LD度量作为两个非连锁位点之间基因 - 基因相互作用度量的函数的性质。我们研究了两个位点之间的相互作用如何在疾病群体中产生LD,并表明两个位点之间基因 - 基因相互作用新定义的数学公式与两个位点之间LD的数学公式相似。这一发现促使我们开发一种基于LD的统计量来检测两个非连锁位点之间的基因 - 基因相互作用。通过广泛的模拟研究验证了用于检验基因 - 基因相互作用的基于LD的统计量的零分布和I型错误率。我们发现,在三种双位点疾病模型下,新的检验统计量比传统逻辑回归更有效,并证明检验统计量的功效取决于基因 - 基因相互作用的度量。我们还研究了使用标签单核苷酸多态性(tagging SNPs)进行相互作用检验对检测两个非连锁位点之间相互作用功效的影响。最后,为了评估我们新方法的性能,我们将基于LD的数据统计量应用于两个已发表的数据集。我们的结果表明,基于LD的统计量的P值小于通过其他方法(包括逻辑回归模型)获得的P值。