Department of Neurology, University of California, San Francisco, San Francisco, CA, United States.
Division of Biomedical Informatics and Personalized Medicine, Department of Immunology, University of Colorado, Denver, CO, United States.
Front Immunol. 2019 May 17;10:989. doi: 10.3389/fimmu.2019.00989. eCollection 2019.
The () region comprises a fast-evolving family of genes that encode receptors for natural killer (NK) cells and have crucial role in host defense. Evolution of was examined in the context of the human genome. Gene-content diversity and single nucleotide polymorphisms (SNP) in the genes and flanking regions were compared to >660,000 genome-wide SNPs in over 800 individuals from 52 populations of the human genome diversity panel (HGDP). allelic diversity was further examined using next generation sequencing in a subset of 56 individuals. We identified the SNP located in as a marker of and and, consequently, Cen A and Cen B haplotypes. We also show that combinations of two SNPs ( and ) distinguish from and also define the major high- and low-expressing alleles lineages. Comparing the diversity of the SNPs within the region to remainder of the genome, we observed a high diversity for the centromeric region consistent with balancing selection ( < 0.01); in contrast, centromeric diversity is significantly reduced in East Asian populations ( < 0.01), indicating purifying selection. By analyzing SNP haplotypes in a region spanning ~500 kb that includes the cluster, we observed evidence of strong positive selection in Africa for high-expressing alleles, favored over the low-expressing alleles ( < 0.01). In sharp contrast, the strong positive selection ( < 0.01) that we also observed in the telomeric region in Oceanic populations tracked with a high frequency of . In addition, we demonstrated that worldwide frequency of high-expression alleles was correlated with virus with virus (r = 0.64, < 10) and protozoa (r = 0.69, < 10) loads, which points to selection globally on high-expressing alleles attributable to pathogen exposure.
该()区域包含一个快速进化的基因家族,这些基因编码自然杀伤 (NK) 细胞的受体,在宿主防御中起着至关重要的作用。在人类基因组的背景下研究了的进化。将基因和侧翼区域中的基因含量多样性和单核苷酸多态性 (SNP) 与来自人类基因组多样性面板 (HGDP) 的 52 个人群中超过 800 个人的 >660,000 个全基因组 SNP 进行了比较。使用下一代测序在 56 个人的亚集中进一步检查了的等位基因多样性。我们确定了位于中的 SNP 作为和的标记,因此,Cen A 和 Cen B 单倍型。我们还表明,两个 SNP(和)的组合可将与和区分开来,还定义了主要的高表达和低表达等位基因谱系。将 SNP 内的多样性与基因组其余部分进行比较,我们观察到着丝粒区域的多样性很高,与平衡选择一致(<0.01);相比之下,东亚人群的着丝粒区域的多样性显着降低(<0.01),表明纯化选择。通过分析跨越包括基因簇的~500 kb 区域的 SNP 单倍型,我们观察到在非洲强烈选择高表达的等位基因的证据,高表达的等位基因有利于低表达的等位基因(<0.01)。相比之下,我们在大洋洲人群中观察到的端粒区域的强烈正选择(<0.01)与高频率的跟踪,这表明对病原体暴露的全球选择。此外,我们证明了高表达的全球频率与病毒(r = 0.64,<10)和原生动物(r = 0.69,<10)的载量相关,这表明全球对归因于病原体暴露的高表达等位基因的选择。