Chen Bowang, Cole John W, Grond-Ginsbach Caspar
Department of Biology, Southern University of Science and Technology, Shenzhen, China.
Department of Neurology, Baltimore VA Medical Center (VHA), University of Maryland School of Medicine, Baltimore, MD, United States.
Front Genet. 2017 Oct 31;8:167. doi: 10.3389/fgene.2017.00167. eCollection 2017.
Departure from Hardy Weinberg Equilibrium (HWE) may occur due to a variety of causes, including purifying selection, inbreeding, population substructure, copy number variation or genotyping error. We searched for specific characteristics of HWE-departure due to genotyping error. Genotypes of a random set of genetic variants were obtained from the Exome Aggregation Consortium (ExAC) database. Variants with <80% successful genotypes or with minor allele frequency (MAF) <1% were excluded. HWE-departure (d-HWE) was considered significant at < 10E-05 and classified as d-HWE with loss of heterozygosity (LoH d-HWE) or d-HWE with excess heterozygosity (gain of heterozygosity: GoH d-HWE). Missing genotypes, variant type (single nucleotide polymorphism (SNP) vs. insertion/deletion); MAF, standard deviation (SD) of MAF across populations (MAF-SD) and copy number variation were evaluated for association with HWE-departure. The study sample comprised 3,204 genotype distributions. HWE-departure was observed in 134 variants: LoH d-HWE in 41 (1.3%), GoH d-HWE in 93 (2.9%) variants. LoH d-HWE was more likely in variants located within deletion polymorphisms ( < 0.001) and in variants with higher MAF-SD ( = 0.0077). GoH d-HWE was associated with low genotyping rate, with variants of insertion/deletion type and with high MAF (all at < 0.001). In a sub-sample of 2,196 variants with genotyping rate >98%, LoH d-HWE was found in 29 (1.3%) variants, but no GoH d-HWE was detected. The findings of the non-random distribution of HWE-violating SNPs along the chromosome, the association with common deletion polymorphisms and indel-variant type, and the finding of excess heterozygotes in genomic regions that are prone to cross-hybridization were confirmed in a large sample of short variants from the 1,000 Genomes Project. We differentiated between two types of HWE-departure. GoH d-HWE was suggestive for genotyping error. LoH d-HWE, on the contrary, pointed to natural variabilities such as population substructure or common deletion polymorphisms.
哈迪-温伯格平衡(HWE)的偏离可能由多种原因引起,包括纯化选择、近亲繁殖、种群亚结构、拷贝数变异或基因分型错误。我们研究了由于基因分型错误导致的HWE偏离的特定特征。从外显子组聚合联盟(ExAC)数据库中获取了一组随机遗传变异的基因型。排除了成功基因型<80%或次要等位基因频率(MAF)<1%的变异。当P值<10E-05时,HWE偏离(d-HWE)被认为具有统计学意义,并被分类为杂合性缺失的d-HWE(LoH d-HWE)或杂合性增加的d-HWE(杂合性增加:GoH d-HWE)。评估了缺失基因型、变异类型(单核苷酸多态性(SNP)与插入/缺失)、MAF、各群体MAF的标准差(SD)(MAF-SD)和拷贝数变异与HWE偏离的相关性。研究样本包括3204个基因型分布。在134个变异中观察到HWE偏离:41个(1.3%)为LoH d-HWE,93个(2.9%)为GoH d-HWE。位于缺失多态性内的变异(P<0.001)和MAF-SD较高的变异(P = 0.0077)更可能出现LoH d-HWE。GoH d-HWE与低基因分型率、插入/缺失类型的变异和高MAF相关(所有P值均<0.001)。在基因分型率>98%的2196个变异的子样本中,发现29个(1.3%)变异存在LoH d-HWE,但未检测到GoH d-HWE。在来自千人基因组计划的大量短变异样本中,证实了违反HWE的SNP在染色体上的非随机分布、与常见缺失多态性和插入缺失变异类型的关联,以及在易于交叉杂交的基因组区域中存在过量杂合子的发现。我们区分了两种类型的HWE偏离。GoH d-HWE提示基因分型错误。相反,LoH d-HWE则指向自然变异,如种群亚结构或常见缺失多态性。