Department of Preventive Medicine, University of Southern California, Los Angeles, California 90089-9010, USA.
Genet Epidemiol. 2011 Apr;35(3):201-10. doi: 10.1002/gepi.20569. Epub 2011 Feb 9.
Many complex diseases are likely to be a result of the interplay of genes and environmental exposures. The standard analysis in a genome-wide association study (GWAS) scans for main effects and ignores the potentially useful information in the available exposure data. Two recently proposed methods that exploit environmental exposure information involve a two-step analysis aimed at prioritizing the large number of SNPs tested to highlight those most likely to be involved in a GE interaction. For example, Murcray et al. ([2009] Am J Epidemiol 169:219–226) proposed screening on a test that models the G-E association induced by an interaction in the combined case-control sample. Alternatively, Kooperberg and LeBlanc ([2008] Genet Epidemiol 32:255–263) suggested screening on genetic marginal effects. In both methods, SNPs that pass the respective screening step at a pre-specified significance threshold are followed up with a formal test of interaction in the second step. We propose a hybrid method that combines these two screening approaches by allocating a proportion of the overall genomewide significance level to each test. We show that the Murcray et al. approach is often the most efficient method, but that the hybrid approach is a powerful and robust method for nearly any underlying model. As an example, for a GWAS of 1 million markers including a single true disease SNP with minor allele frequency of 0.15, and a binary exposure with prevalence 0.3, the Murcray, Kooperberg and hybrid methods are 1.90, 1.27, and 1.87 times as efficient, respectively, as the traditional case-control analysis to detect an interaction effect size of 2.0.
许多复杂疾病可能是基因与环境暴露相互作用的结果。全基因组关联研究(GWAS)中的标准分析扫描主要效应,忽略了现有暴露数据中有用的信息。最近提出的两种利用环境暴露信息的方法涉及两步分析,旨在优先对大量测试的 SNP 进行排序,以突出那些最有可能参与基因-环境相互作用的 SNP。例如,Murcray 等人(2009 年,《美国流行病学杂志》169:219-226)提出在测试中筛选,该测试模拟了在病例对照组合样本中相互作用引起的 G-E 关联。或者,Kooperberg 和 LeBlanc(2008 年,《遗传流行病学》32:255-263)建议在遗传边际效应上进行筛选。在这两种方法中,通过在预先指定的显著阈值下通过各自的筛选步骤的 SNP 会在第二步中进行交互作用的正式测试。我们提出了一种混合方法,通过将总全基因组显著性水平的一部分分配给每个测试,从而结合了这两种筛选方法。我们表明,Murcray 等人的方法通常是最有效的方法,但混合方法是几乎任何基础模型的强大而稳健的方法。例如,对于包含 100 万个标记的 GWAS,包括一个等位基因频率为 0.15 的真实疾病 SNP 和一个患病率为 0.3 的二进制暴露,Murcray、Kooperberg 和混合方法的效率分别是传统病例对照分析的 1.90、1.27 和 1.87 倍,以检测 2.0 的相互作用效应大小。