Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA.
C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA.
Expert Rev Proteomics. 2021 Oct;18(10):863-880. doi: 10.1080/14789450.2021.1992277. Epub 2021 Nov 22.
While the origin and evolution of proteins remain mysterious, advances in evolutionary genomics and systems biology are facilitating the historical exploration of the structure, function and organization of proteins and proteomes. Molecular chronologies are series of time events describing the history of biological systems and subsystems and the rise of biological innovations. Together with time-varying networks, these chronologies provide a window into the past.
Here, we review molecular chronologies and networks built with modern methods of phylogeny reconstruction. We discuss how chronologies of structural domain families uncover the explosive emergence of metabolism, the late rise of translation, the co-evolution of ribosomal proteins and rRNA, and the late development of the ribosomal exit tunnel; events that coincided with a tendency to shorten folding time. Evolving networks described the early emergence of domains and a late 'big bang' of domain combinations.
Two processes, folding and recruitment appear central to the evolutionary progression. The former increases protein persistence. The later fosters diversity. Chronologically, protein evolution mirrors folding by combining supersecondary structures into domains, developing translation machinery to facilitate folding speed and stability, and enhancing structural complexity by establishing long-distance interactions in novel structural and architectural designs.
尽管蛋白质的起源和演化仍然神秘莫测,但进化基因组学和系统生物学的进步正在促进对蛋白质和蛋白质组的结构、功能和组织的历史探索。分子年代表是一系列时间事件,描述了生物系统和子系统的历史以及生物创新的兴起。这些年代表与时变网络一起,为我们提供了一个了解过去的窗口。
在这里,我们回顾了使用现代系统发育重建方法构建的分子年代表和网络。我们讨论了结构域家族的年代表如何揭示代谢的爆发式出现、翻译的后期兴起、核糖体蛋白和 rRNA 的共同进化,以及核糖体出口隧道的后期发展;这些事件与折叠时间缩短的趋势同时发生。进化网络描述了结构域的早期出现以及后期的“大爆炸”。
折叠和招募这两个过程似乎是进化进程的核心。前者增加了蛋白质的持久性。后者促进了多样性。从时间上看,蛋白质进化通过将超二级结构组合成结构域来模拟折叠,开发翻译机制以促进折叠速度和稳定性,并通过在新的结构和架构设计中建立长程相互作用来提高结构复杂性。