Greene Casey S, Himmelstein Daniel S, Nelson Heather H, Kelsey Karl T, Williams Scott M, Andrew Angeline S, Karagas Margaret R, Moore Jason H
Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756, USA.
Pac Symp Biocomput. 2010:327-36. doi: 10.1142/9789814295291_0035.
One goal of personal genomics is to use information about genomic variation to predict who is at risk for various common diseases. Technological advances in genotyping have spawned several personal genetic testing services that market genotyping services directly to the consumer. An important goal of consumer genetic testing is to provide health information along with the genotyping results. This has the potential to integrate detailed personal genetic and genomic information into healthcare decision making. Despite the potential importance of these advances, there are some important limitations. One concern is that much of the literature that is used to formulate personal genetics reports is based on genetic association studies that consider each genetic variant independently of the others. It is our working hypothesis that the true value of personal genomics will only be realized when the complexity of the genotype-to-phenotype mapping relationship is embraced, rather than ignored. We focus here on complexity in genetic architecture due to epistasis or nonlinear gene-gene interaction. We have previously developed a multifactor dimensionality reduction (MDR) algorithm and software package for detecting nonlinear interactions in genetic association studies. In most prior MDR analyses, the permutation testing strategy used to assess statistical significance was unable to differentiate MDR models that captured only interaction effects from those that also detected independent main effects. Statistical interpretation of MDR models required post-hoc analysis using entropy-based measures of interaction information. We introduce here a novel permutation test that allows the effects of nonlinear interactions between multiple genetic variants to be specifically tested in a manner that is not confounded by linear additive effects. We show using simulated nonlinear interactions that the power using the explicit test of epistasis is no different than a standard permutation test. We also show that the test has the appropriate size or type I error rate of approximately 0.05. We then apply MDR with the new explicit test of epistasis to a large genetic study of bladder cancer and show that a previously reported nonlinear interaction between is indeed significant, even after considering the strong additive effect of smoking in the model. Finally, we evaluated the power of the explicit test of epistasis to detect the nonlinear interaction between two XPD gene polymorphisms by simulating data from the MDR model of bladder cancer susceptibility. The results of this study provide for the first time a simple method for explicitly testing epistasis or gene-gene interaction effects in genetic association studies. Although we demonstrated the method with MDR, an important advantage is that it can be combined with any modeling approach. The explicit test of epistasis brings us a step closer to the type of routine gene-gene interaction analysis that is needed if we are to enable personal genomics.
个人基因组学的一个目标是利用基因组变异信息来预测谁有患各种常见疾病的风险。基因分型技术的进步催生了几种直接向消费者推销基因分型服务的个人基因检测服务。消费者基因检测的一个重要目标是在提供基因分型结果的同时提供健康信息。这有可能将详细的个人遗传和基因组信息整合到医疗决策中。尽管这些进展具有潜在的重要性,但也存在一些重要的局限性。一个担忧是,用于制定个人基因报告的许多文献是基于基因关联研究,这些研究独立地考虑每个基因变异。我们的工作假设是,只有当基因型与表型映射关系的复杂性得到认可而非被忽视时,个人基因组学的真正价值才能实现。我们在此关注由于上位性或非线性基因 - 基因相互作用导致的遗传结构复杂性。我们之前开发了一种多因素降维(MDR)算法和软件包,用于在基因关联研究中检测非线性相互作用。在大多数先前的MDR分析中,用于评估统计显著性的置换检验策略无法区分仅捕获相互作用效应的MDR模型与那些还检测到独立主效应的模型。MDR模型的统计解释需要使用基于熵的相互作用信息度量进行事后分析。我们在此介绍一种新颖的置换检验,该检验允许以一种不受线性加性效应混淆的方式专门测试多个基因变异之间的非线性相互作用的效应。我们使用模拟的非线性相互作用表明,使用明确的上位性检验的功效与标准置换检验无异。我们还表明该检验具有适当的大小或约为0.05的I型错误率。然后,我们将带有新的明确上位性检验的MDR应用于一项膀胱癌的大型基因研究,并表明即使在模型中考虑了吸烟的强烈加性效应之后,先前报道的非线性相互作用确实是显著的。最后,我们通过模拟膀胱癌易感性的MDR模型数据,评估了明确的上位性检验检测两个XPD基因多态性之间非线性相互作用的功效。这项研究的结果首次提供了一种在基因关联研究中明确测试上位性或基因 - 基因相互作用效应的简单方法。尽管我们用MDR演示了该方法,但一个重要的优点是它可以与任何建模方法相结合。明确的上位性检验使我们更接近实现个人基因组学所需的常规基因 - 基因相互作用分析类型。