Sodoyer R, Damotte M, Delovitch T L, Trucy J, Jordan B R, Strachan T
EMBO J. 1984 Apr;3(4):879-85. doi: 10.1002/j.1460-2075.1984.tb01900.x.
The HLA-CW3 gene contained in a cosmid clone identified by transfection expression experiments has been completely sequenced. This provides, for the first time, data on the structure of HLA-C locus products and constitutes, together with that of the gene coding for HLA-A3, the first complete nucleotide sequences of genes coding for serologically defined class I HLA molecules. In contrast to the organisation of the two class I HLA pseudogenes whose sequences have previously been determined, the sequence of the HLA-CW3 gene reveals an additional cytoplasmic encoding domain, making the organisation of this gene very similar to that of known H-2 class I genes and also the HLA-A3 gene. The deduced amino acid sequences of HLA-CW3 and HLA-A3 now allow a systematic comparison of such sequences of HLA class I molecules from the three classical transplantation antigen loci A, B, C. The compared sequences include the previously determined partial amino acid sequences of HLA-B7, HLA-B40, HLA-A2 and HLA-A28. The comparisons confirm the extreme polymorphism of HLA classical class I molecules, and permit a study of the level of diversity and the location of sequence differences. The distribution of differences is not uniform, most of them being located in the first and second extracellular domains, the third extracellular domain is extremely conserved, and the cytoplasmic domain is also a variable region. Although it is difficult to determine locus-specific regions, we have identified several candidate positions which may be C locus-specific.
通过转染表达实验鉴定的黏粒克隆中所含的HLA - CW3基因已被完全测序。这首次提供了关于HLA - C基因座产物结构的数据,并与编码HLA - A3的基因序列一起,构成了编码血清学定义的I类HLA分子的基因的首批完整核苷酸序列。与之前已确定序列的两个I类HLA假基因的结构不同,HLA - CW3基因的序列揭示了一个额外的胞质编码结构域,使得该基因的结构与已知的H - 2 I类基因以及HLA - A3基因非常相似。HLA - CW3和HLA - A3推导的氨基酸序列现在允许对来自三个经典移植抗原基因座A、B、C的I类HLA分子的此类序列进行系统比较。比较的序列包括先前确定的HLA - B7、HLA - B40、HLA - A2和HLA - A28的部分氨基酸序列。这些比较证实了HLA经典I类分子的极端多态性,并允许研究多样性水平和序列差异的位置。差异的分布并不均匀,大多数差异位于第一和第二细胞外结构域,第三细胞外结构域极其保守,胞质结构域也是一个可变区域。尽管难以确定基因座特异性区域,但我们已经确定了几个可能是C基因座特异性的候选位置。