Wigginton Janis E, Cutler David J, Abecasis Goncalo R
Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
Am J Hum Genet. 2005 May;76(5):887-93. doi: 10.1086/429864. Epub 2005 Mar 23.
Deviations from Hardy-Weinberg equilibrium (HWE) can indicate inbreeding, population stratification, and even problems in genotyping. In samples of affected individuals, these deviations can also provide evidence for association. Tests of HWE are commonly performed using a simple chi2 goodness-of-fit test. We show that this chi2 test can have inflated type I error rates, even in relatively large samples (e.g., samples of 1,000 individuals that include approximately 100 copies of the minor allele). On the basis of previous work, we describe exact tests of HWE together with efficient computational methods for their implementation. Our methods adequately control type I error in large and small samples and are computationally efficient. They have been implemented in freely available code that will be useful for quality assessment of genotype data and for the detection of genetic association or population stratification in very large data sets.
偏离哈迪-温伯格平衡(HWE)可能表明存在近亲繁殖、群体分层,甚至基因分型问题。在受影响个体的样本中,这些偏差也可为关联提供证据。HWE检验通常使用简单的卡方拟合优度检验来进行。我们表明,即使在相对大的样本中(例如,包含约100个次要等位基因拷贝的1000个个体的样本),这种卡方检验的I型错误率也可能会膨胀。基于先前的工作,我们描述了HWE的精确检验及其实施的高效计算方法。我们的方法在大样本和小样本中都能充分控制I型错误,并且计算效率高。它们已在免费提供的代码中实现,这将有助于对基因型数据进行质量评估,并在非常大的数据集中检测基因关联或群体分层。