Biostatistics and Biomathematics Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
Genet Epidemiol. 2012 Apr;36(3):183-94. doi: 10.1002/gepi.21610.
Identifying gene and environment interaction (G × E) can provide insights into biological networks of complex diseases, identify novel genes that act synergistically with environmental factors, and inform risk prediction. However, despite the fact that hundreds of novel disease-associated loci have been identified from genome-wide association studies (GWAS), few G × Es have been discovered. One reason is that most studies are underpowered for detecting these interactions. Several new methods have been proposed to improve power for G × E analysis, but performance varies with scenario. In this article, we present a module-based approach to integrating various methods that exploits each method's most appealing aspects. There are three modules in our approach: (1) a screening module for prioritizing Single Nucleotide Polymorphisms (SNPs); (2) a multiple comparison module for testing G × E; and (3) a G × E testing module. We combine all three of these modules and develop two novel "cocktail" methods. We demonstrate that the proposed cocktail methods maintain the type I error, and that the power tracks well with the best existing methods, despite that the best methods may be different under various scenarios and interaction models. For GWAS, where the true interaction models are unknown, methods like our "cocktail" methods that are powerful under a wide range of situations are particularly appealing. Broadly speaking, the modular approach is conceptually straightforward and computationally simple. It builds on common test statistics and is easily implemented without additional computational efforts. It also allows for an easy incorporation of new methods as they are developed. Our work provides a comprehensive and powerful tool for devising effective strategies for genome-wide detection of gene-environment interactions.
识别基因与环境相互作用(G×E)可以深入了解复杂疾病的生物学网络,发现与环境因素协同作用的新基因,并为风险预测提供信息。然而,尽管从全基因组关联研究(GWAS)中已经确定了数百个新的与疾病相关的基因座,但很少发现 G×E。原因之一是大多数研究在检测这些相互作用方面的能力不足。已经提出了几种新的方法来提高 G×E 分析的功效,但性能因情况而异。在本文中,我们提出了一种基于模块的方法来整合各种方法,利用每种方法最吸引人的方面。我们的方法有三个模块:(1)用于优先筛选单核苷酸多态性(SNP)的筛选模块;(2)用于测试 G×E 的多重比较模块;(3)用于测试 G×E 的模块。我们将这三个模块结合起来,开发了两种新的“鸡尾酒”方法。我们证明,所提出的鸡尾酒方法保持了第一类错误,并且尽管在不同的情况下和交互模型下最佳方法可能不同,但功效与现有的最佳方法很好地吻合。对于 GWAS,由于真正的交互模型未知,因此像我们的“鸡尾酒”方法这样在广泛情况下具有强大功效的方法特别有吸引力。广义上讲,模块化方法在概念上简单明了,计算上也很简单。它建立在常见的检验统计量的基础上,无需额外的计算工作即可轻松实现。它还允许轻松合并新方法,因为它们是开发的。我们的工作为设计有效的全基因组检测基因-环境相互作用的策略提供了一个全面而强大的工具。