Evans David M, Marchini Jonathan, Morris Andrew P, Cardon Lon R
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom.
PLoS Genet. 2006 Sep 22;2(9):e157. doi: 10.1371/journal.pgen.0020157.
Studies in model organisms suggest that epistasis may play an important role in the etiology of complex diseases and traits in humans. With the era of large-scale genome-wide association studies fast approaching, it is important to quantify whether it will be possible to detect interacting loci using realistic sample sizes in humans and to what extent undetected epistasis will adversely affect power to detect association when single-locus approaches are employed. We therefore investigated the power to detect association for an extensive range of two-locus quantitative trait models that incorporated varying degrees of epistasis. We compared the power to detect association using a single-locus model that ignored interaction effects, a full two-locus model that allowed for interactions, and, most important, two two-stage strategies whereby a subset of loci initially identified using single-locus tests were analyzed using the full two-locus model. Despite the penalty introduced by multiple testing, fitting the full two-locus model performed better than single-locus tests for many of the situations considered, particularly when compared with attempts to detect both individual loci. Using a two-stage strategy reduced the computational burden associated with performing an exhaustive two-locus search across the genome but was not as powerful as the exhaustive search when loci interacted. Two-stage approaches also increased the risk of missing interacting loci that contributed little effect at the margins. Based on our extensive simulations, our results suggest that an exhaustive search involving all pairwise combinations of markers across the genome might provide a useful complement to single-locus scans in identifying interacting loci that contribute to moderate proportions of the phenotypic variance.
对模式生物的研究表明,上位性可能在人类复杂疾病和性状的病因学中发挥重要作用。随着大规模全基因组关联研究时代的快速临近,量化在人类中使用实际样本量检测相互作用基因座是否可行,以及当采用单基因座方法时未检测到的上位性会在多大程度上对检测关联的效能产生不利影响,这一点很重要。因此,我们研究了一系列包含不同程度上位性的双基因座数量性状模型检测关联的效能。我们比较了使用忽略相互作用效应的单基因座模型、允许相互作用的完整双基因座模型,以及最重要的两种两阶段策略检测关联的效能,其中一种两阶段策略是对最初使用单基因座检验鉴定出的一部分基因座,使用完整双基因座模型进行分析。尽管多重检验会带来惩罚,但在许多考虑的情况下,拟合完整双基因座模型的表现优于单基因座检验,特别是与检测单个基因座的尝试相比时。使用两阶段策略减轻了在全基因组中进行详尽双基因座搜索的计算负担,但当基因座相互作用时,其效能不如详尽搜索。两阶段方法还增加了遗漏在边缘效应较小的相互作用基因座的风险。基于我们广泛的模拟,我们的结果表明,涉及全基因组标记所有成对组合的详尽搜索,可能为单基因座扫描提供有用补充,以识别对表型变异有适度贡献比例的相互作用基因座。