Department of Pathology, Stanford University School of Medicine, Palo Alto, CA, USA; Histocompatibility, Immunogenetics and Disease Profiling Laboratory, Stanford Blood Center, Palo Alto, CA, USA.
Histocompatibility, Immunogenetics and Disease Profiling Laboratory, Stanford Blood Center, Palo Alto, CA, USA.
Hum Immunol. 2019 Oct;80(10):807-822. doi: 10.1016/j.humimm.2019.07.275. Epub 2019 Jul 22.
The human leukocyte antigen (HLA) genes are extremely polymorphic and are useful molecular markers to make inferences about human population history. However, the accuracy of the estimation of genetic diversity at HLA loci very much depends on the technology used to characterize HLA alleles; high-resolution genotyping of long-range HLA gene products improves the assessment of HLA population diversity as well as other population parameters compared to lower resolution typing methods. In this study we examined allelic and haplotype HLA diversity in a large healthy European American population sourced from the UCSF-DNA bank. A high-resolution next-generation sequencing method was applied to define non-ambiguous 3- and 4-field alleles at the HLA-A, HLA-C, HLA-B, HLA-DRB1, HLA-DRB3/4/5, HLA-DQA1, HLA-DQB1, HLA-DPA1, and HLA-DPB1 loci in samples provided by 2248 unrelated individuals. A number of population parameters were examined including balancing selection and various measurements of linkage disequilibrium were calculated. There were no detectable deviations from Hardy-Weinberg proportions at HLA-A, HLA-DRB1, HLA-DQA1 and HLA-DQB1. For the remaining loci moderate and significant deviations were detected at HLA-C, HLA-B, HLA-DRB3/4/5, HLA-DPA1 and HLA-DPB1 loci mostly from population substructures. Unique 4-field associations were observed among alleles at 2 loci and haplotypes extending large intervals that were not apparent in results obtained using testing methodologies with limited sequence coverage and phasing. The high diversity at HLA-DPA1 results from detection of intron variants of otherwise well conserved protein sequences. It may be speculated that divergence in exon sequences may be negatively selected. Our data provides a valuable reference source for future population studies that may allow for precise fine mapping of coding and non-coding sequences determining disease susceptibility and allo-immunogenicity.
人类白细胞抗原 (HLA) 基因具有极高的多态性,是推断人类群体历史的有用分子标记。然而,HLA 基因座遗传多样性的估计准确性在很大程度上取决于用于描述 HLA 等位基因的技术;与低分辨率分型方法相比,长距离 HLA 基因产物的高分辨率基因分型可提高 HLA 群体多样性以及其他群体参数的评估。在这项研究中,我们检查了来自 UCSF-DNA 银行的大型健康欧洲裔美国人人群中的等位基因和单倍型 HLA 多样性。应用高分辨率下一代测序方法在 2248 名无关个体提供的样本中定义 HLA-A、HLA-C、HLA-B、HLA-DRB1、HLA-DRB3/4/5、HLA-DQA1、HLA-DQB1、HLA-DPA1 和 HLA-DPB1 基因座的非歧义 3 位和 4 位等位基因。检查了许多群体参数,包括平衡选择和计算了各种连锁不平衡测量值。在 HLA-A、HLA-DRB1、HLA-DQA1 和 HLA-DQB1 基因座未检测到 Hardy-Weinberg 比例的偏差。对于其余的基因座,在 HLA-C、HLA-B、HLA-DRB3/4/5、HLA-DPA1 和 HLA-DPB1 基因座检测到中等和显著的偏差,主要是由于群体亚结构造成的。在 2 个基因座的等位基因和扩展到使用具有有限序列覆盖和相位的测试方法无法明显观察到的大间隔的单倍型中观察到独特的 4 位关联。HLA-DPA1 的高度多样性源于对其他情况下高度保守的蛋白质序列的内含子变体的检测。可以推测,外显子序列的差异可能受到负选择。我们的数据为未来的群体研究提供了有价值的参考资料,这可能允许对决定疾病易感性和同种免疫原性的编码和非编码序列进行精确的精细作图。