Suppr超能文献

作为原始地球原始编码序列的碱基寡聚物重复序列及其在现代基因中的遗迹。

Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes.

作者信息

Ohno S

出版信息

J Mol Evol. 1984;20(3-4):313-21. doi: 10.1007/BF02104737.

Abstract

Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either alpha-helical or beta-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the down-stream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.

摘要

碱基寡聚物的重复序列具有三项卓越特性,使其成为所有多肽链原始编码序列的独特之选。其一,与一般随机生成的碱基序列相比,它们更有可能拥有长开放阅读框。其二,此类重复序列所指定的周期性多肽链比随机序列的多肽链更有可能呈现α螺旋或β折叠二级结构。其三,倘若寡聚单元中的碱基数不是3的倍数,这些内部重复的编码序列就不会受到随机发生的碱基替换、缺失和插入的影响。这是因为其多肽链的重复周期性是由寡聚单元在三个不同阅读框中连续翻译三次给出的。因此,当一个阅读框开放时,其他两个阅读框也会自动开放,所有三个阅读框都能够编码具有相同周期性的多肽链。在这种情况下,由于缺失或插入非3倍数个碱基导致的移码不会改变下游氨基酸序列,甚至导致链提前终止的碱基变化也只会使三个潜在编码单元中的一个沉默。现代生物体中新出现的编码序列是寡聚重复序列,而且大多数较古老的基因仍保留着其原始内部重复的各种痕迹。一些基因(如癌基因)甚至继承了不受随机发生的碱基变化影响的特性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验