Bornberg-Bauer Erich, Schmitz Jonathan, Heberlein Magdalena
Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, D48149 Muenster, Germany
Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, D48149 Muenster, Germany.
Biochem Soc Trans. 2015 Oct;43(5):867-73. doi: 10.1042/BST20150089.
Proteins are the workhorses of the cell and, over billions of years, they have evolved an amazing plethora of extremely diverse and versatile structures with equally diverse functions. Evolutionary emergence of new proteins and transitions between existing ones are believed to be rare or even impossible. However, recent advances in comparative genomics have repeatedly called some 10%-30% of all genes without any detectable similarity to existing proteins. Even after careful scrutiny, some of those orphan genes contain protein coding reading frames with detectable transcription and translation. Thus some proteins seem to have emerged from previously non-coding 'dark genomic matter'. These 'de novo' proteins tend to be disordered, fast evolving, weakly expressed but also rapidly assuming novel and physiologically important functions. Here we review mechanisms by which 'de novo' proteins might be created, under which circumstances they may become fixed and why they are elusive. We propose a 'grow slow and moult' model in which first a reading frame is extended, coding for an initially disordered and non-globular appendage which, over time, becomes more structured and may also become associated with other proteins.
蛋白质是细胞的“主力军”,在数十亿年的时间里,它们进化出了大量惊人的、极其多样且功能各异的结构。新蛋白质的进化出现以及现有蛋白质之间的转变被认为是罕见的,甚至是不可能的。然而,比较基因组学的最新进展一再表明,所有基因中有10% - 30%与现有蛋白质没有任何可检测到的相似性。即使经过仔细审查,其中一些孤儿基因仍含有可检测到转录和翻译的蛋白质编码阅读框。因此,一些蛋白质似乎是从以前的非编码“暗基因组物质”中产生的。这些“从头开始”的蛋白质往往是无序的、快速进化的、低表达的,但也能迅速承担新的生理上重要的功能。在这里,我们综述了“从头开始”的蛋白质可能产生的机制、它们在何种情况下可能被固定以及它们难以捉摸的原因。我们提出了一个“缓慢生长和蜕变”模型,即首先扩展一个阅读框,编码一个最初无序且非球状的附属物,随着时间的推移,这个附属物会变得更有结构,也可能与其他蛋白质结合。