Robinson James, Guethlein Lisbeth A, Cereb Nezih, Yang Soo Young, Norman Paul J, Marsh Steven G E, Parham Peter
Anthony Nolan Research Institute, London, United Kingdom.
UCL Cancer Institute, University College London, London, United Kingdom.
PLoS Genet. 2017 Jun 26;13(6):e1006862. doi: 10.1371/journal.pgen.1006862. eCollection 2017 Jun.
HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the 'second' most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8-9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism.
HLA I类糖蛋白包含结合肽抗原并与淋巴细胞受体结合的功能位点。最近,基于序列的HLA分型的临床应用发现了数量空前的新型HLA I类等位基因。在此,我们定义了3489个HLA-A、4356个HLA-B和3111个HLA-C等位基因变异的性质和程度。该分析需要开发一套具有普遍适用性的方法,用于比较和分析大量同源序列。在HLA-A、-B和-C的多态性α1和α2结构域的每个位置至少存在三个氨基酸替换。少数位置的“第二”最常见核苷酸的发生率>1%,在HLA-A中有70个位置,在HLA-B中有85个位置,在HLA-C中有54个位置。这些位置中的大多数有三个或四个替代核苷酸。这些位置受到正选择,对应于肽和受体的结合位点。大多数HLA I类等位基因(>80%)非常罕见,通常在一个人或一个家族中被鉴定出来,它们与较古老、较常见的等位基因存在点突变差异。这些具有单核苷酸多态性的等位基因反映了种系突变率。它们的频率预测人类群体中存在800-900万个HLA I类变体。人类群体中的常见等位基因包括42个核心等位基因,它们代表了所有选定的多态性以及对这种多态性进行分类的重组体。