Ohno S
Clin Genet. 1983 Jan;23(1):58-69. doi: 10.1111/j.1399-0004.1983.tb00438.x.
In our previous paper, we identified 32 recurring base oligomers (1 decamer, 4 octamers, 9 heptamers and 18 hexamers) within the coding sequence for the mouse class I major histocompatibility (MHC) antigen H-2Kb. The compilation of these recurring base oligomers led to the conclusion that the entire ancestral coding sequence for class 1 MHC antigens evolved from tandem repeats of the one 45 base-long primordial building block base sequence. As with most other mammalian genes, the gene for each class I MHC antigen is an admixture of coding and noncoding segments interspersed with each other. Thus, the status of noncoding segments should be clarified in relation to the concept of the primordial building block. The published 4,122 base-long sequence of human pHLA 12.4 germline gene afforded me an opportunity for clarification. Since the number of recurring base oligomers residing within the entire 4,000 + base-long sequence proved unmanageably numerous, the two portions 1,200 bases in the total length were singled out. Within these portions containing five noncoding and four coding segments, noncoding and coding segments shared 1 nonomer, 3 octamers, 6 heptamers and 5 hexamers; all the above-noted base oligomers were derived from different parts of the same 45 base-long primordial building block. It was thus concluded that not only the coding segments, but the entire ancestral gene for class I MHC antigens evolved from tandem repeats of the 45-base-long primordial building block. This new concept of primordial building block was discussed in relation to the mechanism of evolution by gene duplication.
在我们之前的论文中,我们在小鼠I类主要组织相容性(MHC)抗原H-2Kb的编码序列中鉴定出32个重复的碱基寡聚物(1个十聚体、4个八聚体、9个七聚体和18个六聚体)。这些重复碱基寡聚物的汇编得出结论,I类MHC抗原的整个祖先编码序列是由一个45个碱基长的原始构建模块碱基序列的串联重复进化而来的。与大多数其他哺乳动物基因一样,每个I类MHC抗原的基因是相互穿插的编码和非编码片段的混合物。因此,应根据原始构建模块的概念来阐明非编码片段的状态。已发表的人类pHLA 12.4种系基因的4122个碱基长的序列为我提供了一个澄清的机会。由于在整个4000多个碱基长的序列中存在的重复碱基寡聚物数量多得难以处理,因此从全长中挑选出了两个1200个碱基的部分。在这些包含五个非编码和四个编码片段的部分中,非编码和编码片段共享1个九聚体(译者注:原文nonomer有误,应为nonamer)、3个八聚体、6个七聚体和5个六聚体;所有上述碱基寡聚物均来自同一个45个碱基长的原始构建模块的不同部分。因此得出结论,不仅编码片段,而且I类MHC抗原的整个祖先基因都是由45个碱基长的原始构建模块的串联重复进化而来的。本文结合基因重复的进化机制,对这一原始构建模块的新概念进行了讨论。