Yamazaki M, Tateno Y, Inoko H
BioScience Research Laboratory, Fujiya Co., Ltd., Kanagawa, Japan.
J Mol Evol. 1999 Mar;48(3):317-27. doi: 10.1007/pl00006475.
We previously sequenced two regions around the centromeric end of HLA class I and the boundary between class I and class III. In this paper we analyze the two regions of about 385 kb and confirm, giving a new line of evidence, that the following two pairs of the genomic segments were duplicated in evolution: (i) a 43-kb genomic segment including the HLA-B gene showing the highest polymorphism among the classical HLA class I loci (class Ia) and a 40-kb segment including the HLA-C locus showing the lowest polymorphism and (ii) a 52-kb segment including the MIC (MHC class I chain related gene) B and a 35-kb segment including MICA. We also found that repetitive elements such as SINEs, LINEs, and LTRs occupy as much as 47% of nucleotides in this 385-kb region. This unusually high content of repetitive elements indicates that repeat-mediated rearrangements have frequently occurred in the evolutionary history of the HLA class Ia region. Analysis of LINE compositions within the two pairs of duplicated segments revealed that (i) LINEs in these regions had been dispersed prior to both the duplication of the HLA-B and -C loci and the duplication of the MICB and MICA loci, and (ii) the divergence of the HLA-B and -C loci occurred prior to the duplication of the MICA and MICB loci. To find novel genes responsible for HLA class I-associated or other diseases, we performed computer analysis applying GenScan and GRAIL to GenBank's dbEST. As a result, at least five as yet uncharacterized genes were newly mapped on the HLA class I centromeric region studied. These novel genes should be analyzed further to determine their relationships to diseases associated with this region.
我们之前对HLA I类着丝粒末端周围以及I类和III类之间的边界两个区域进行了测序。在本文中,我们分析了约385 kb的这两个区域,并提供新的证据证实,在进化过程中发生了以下两对基因组片段的重复:(i)一个43 kb的基因组片段,包含在经典HLA I类基因座(I类a)中显示出最高多态性的HLA - B基因,以及一个40 kb的片段,包含多态性最低的HLA - C基因座;(ii)一个52 kb的片段,包含MIC(MHC I类链相关基因)B,以及一个35 kb的片段,包含MICA。我们还发现,诸如短散在核元件(SINEs)、长散在核元件(LINEs)和长末端重复序列(LTRs)等重复元件在这个385 kb区域中占据了多达47%的核苷酸。这种异常高的重复元件含量表明,重复介导的重排在HLA I类a区域的进化历史中频繁发生。对这两对重复片段内的LINE组成分析表明:(i)这些区域中的LINEs在HLA - B和 - C基因座重复以及MICB和MICA基因座重复之前就已经分散;(ii)HLA - B和 - C基因座的分化发生在MICA和MICB基因座重复之前。为了找到与HLA I类相关疾病或其他疾病相关的新基因,我们对GenBank的dbEST应用GenScan和GRAIL进行了计算机分析。结果,至少有五个尚未鉴定的新基因被定位到所研究的HLA I类着丝粒区域。这些新基因应进一步分析,以确定它们与该区域相关疾病的关系。