Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, Institute for Genomic Biology and Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
School of Science and Technology, Georgia Gwinnett College, Lawrenceville, GA 30043, USA.
Archaea. 2014 Jun 2;2014:590214. doi: 10.1155/2014/590214. eCollection 2014.
The study of the origin of diversified life has been plagued by technical and conceptual difficulties, controversy, and apriorism. It is now popularly accepted that the universal tree of life is rooted in the akaryotes and that Archaea and Eukarya are sister groups to each other. However, evolutionary studies have overwhelmingly focused on nucleic acid and protein sequences, which partially fulfill only two of the three main steps of phylogenetic analysis, formulation of realistic evolutionary models, and optimization of tree reconstruction. In the absence of character polarization, that is, the ability to identify ancestral and derived character states, any statement about the rooting of the tree of life should be considered suspect. Here we show that macromolecular structure and a new phylogenetic framework of analysis that focuses on the parts of biological systems instead of the whole provide both deep and reliable phylogenetic signal and enable us to put forth hypotheses of origin. We review over a decade of phylogenomic studies, which mine information in a genomic census of millions of encoded proteins and RNAs. We show how the use of process models of molecular accumulation that comply with Weston's generality criterion supports a consistent phylogenomic scenario in which the origin of diversified life can be traced back to the early history of Archaea.
对多样化生命起源的研究一直受到技术和概念上的困难、争议和先验主义的困扰。现在人们普遍接受这样一种观点,即普遍的生命之树植根于原核生物,古菌和真核生物是彼此的姐妹群。然而,进化研究主要集中在核酸和蛋白质序列上,这些序列只能部分满足系统发育分析的三个主要步骤,即现实进化模型的制定和树重建的优化。在缺乏特征极化(即识别祖先和衍生特征状态的能力)的情况下,任何关于生命之树起源的说法都应该被怀疑。在这里,我们表明,大分子结构和一个新的分析框架,侧重于生物系统的部分而不是整体,提供了深刻和可靠的系统发育信号,并使我们能够提出起源的假设。我们回顾了十多年的基因组学研究,这些研究挖掘了数以百万计编码蛋白质和 RNA 的基因组普查中的信息。我们展示了如何使用符合 Weston 普遍性准则的分子积累过程模型来支持一个一致的基因组学情景,在这个情景中,多样化生命的起源可以追溯到古菌的早期历史。