Li Mingyao, Li Chun
Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA.
Genet Epidemiol. 2008 Nov;32(7):589-99. doi: 10.1002/gepi.20335.
Assessing Hardy-Weinberg equilibrium (HWE) is often employed as an important initial step for genotype data quality checking in genetics studies. Tests for HWE often assume that the genotypes are randomly sampled from the general population. However, in many human genetics studies, subjects are ascertained through their disease status, and affected individuals (and their relatives in family-based studies) are overly represented in the ascertained sample than in the general population. As a result, when a marker is associated with the disease, the type I error rate in the HWE tests can be inflated, leading to false exclusion of associated markers from future analysis. Here we develop a general likelihood framework that allows assessment of departure from HWE while taking into account potential association with the disease. Our method can differentiate HWE departure caused by disease association from departure caused by other reasons, such as genotyping errors. The framework can be used for various data structures, including unrelated cases and controls, nuclear families with one or more offspring, or a mixture of them. The type I error rate of our test is under control for a broad range of scenarios. For case-control data, compared to the traditional HWE test that uses only controls, our test is more powerful to detect HWE departure for common diseases and has comparable power for rare diseases. For case-parents trios, our test is more powerful than the traditional HWE test that uses parents only.
评估哈迪-温伯格平衡(HWE)通常被用作遗传学研究中基因型数据质量检查的重要初始步骤。HWE检验通常假定基因型是从一般人群中随机抽样的。然而,在许多人类遗传学研究中,研究对象是通过其疾病状态确定的,与一般人群相比,患病个体(以及基于家系研究中的他们的亲属)在确定的样本中所占比例过高。因此,当一个标记与疾病相关时,HWE检验中的I型错误率可能会膨胀,导致相关标记在未来分析中被错误排除。在此,我们开发了一个通用似然框架,该框架在考虑与疾病潜在关联的同时,能够评估偏离HWE的情况。我们的方法能够区分由疾病关联导致的HWE偏离和由其他原因(如基因分型错误)导致的偏离。该框架可用于各种数据结构,包括无关的病例和对照、有一个或多个后代的核心家庭,或它们的混合。在广泛的场景下,我们检验的I型错误率都在可控范围内。对于病例对照数据,与仅使用对照的传统HWE检验相比,我们的检验在检测常见疾病的HWE偏离方面更具效力,而在检测罕见疾病方面具有相当的效力。对于病例-父母三联体,我们的检验比仅使用父母的传统HWE检验更具效力。