Harish Ajith, Kurland Charles G
Department of Cell and Molecular Biology, Structural and Molecular Biology Program, Uppsala University, Uppsala, Sweden.
Department of Biology, Microbial Ecology Program, Lund University, Lund, Sweden.
Biochimie. 2017 Jul;138:168-183. doi: 10.1016/j.biochi.2017.04.013. Epub 2017 Apr 28.
We reconstructed a global tree of life (ToL) with non-reversible and non-stationary models of genome evolution that root trees intrinsically. We implemented Bayesian model selection tests and compared the statistical support for four conflicting ToL hypotheses. We show that reconstructions obtained with a Bayesian implementation (Klopfstein et al., 2015) are consistent with reconstructions obtained with an empirical Sankoff parsimony (ESP) implementation (Harish et al., 2013). Both are based on the genome contents of coding sequences for protein domains (superfamilies) from hundreds of genomes. Thus, we conclude that the independent descent of Eukaryotes and Akaryotes (archaea and bacteria) from the universal common ancestor (UCA) is the most probable as well as the most parsimonious hypothesis for the evolutionary origins of extant genomes. Reconstructions of ancestral proteomes by both Bayesian and ESP methods suggest that at least 70% of unique domain-superfamilies known in extant species were present in the UCA. In addition, identification of a vast majority (96%) of the mitochondrial superfamilies in the UCA proteome precludes a symbiotic hypothesis for the origin of eukaryotes. Accordingly, neither the archaeal origin of eukaryotes nor the bacterial origin of mitochondria is supported by the data. The proteomic complexity of the UCA suggests that the evolution of cellular phenotypes in the two primordial lineages, Akaryotes and Eukaryotes, was driven largely by duplication of common superfamilies as well as by loss of unique superfamilies. Finally, innovation of novel superfamilies has played a surprisingly small role in the evolution of Akaryotes and only a marginal role in the evolution of Eukaryotes.
我们使用基因组进化的不可逆和非平稳模型重建了一个全球生命树(ToL),这些模型能从本质上确定树的根。我们实施了贝叶斯模型选择测试,并比较了对四个相互冲突的ToL假设的统计支持。我们表明,通过贝叶斯方法(Klopfstein等人,2015年)获得的重建结果与通过经验性桑科夫简约法(ESP)(Harish等人,2013年)获得的重建结果一致。两者都基于数百个基因组中蛋白质结构域(超家族)编码序列的基因组内容。因此,我们得出结论,真核生物和原核生物(古菌和细菌)从普遍共同祖先(UCA)独立演化而来,是现存基因组进化起源最有可能且最简约的假设。通过贝叶斯方法和ESP方法对祖先蛋白质组的重建表明,现存物种中已知的至少70%的独特结构域超家族存在于UCA中。此外,在UCA蛋白质组中鉴定出绝大多数(96%)的线粒体超家族,排除了真核生物起源的共生假说。因此,数据既不支持真核生物起源于古菌,也不支持线粒体起源于细菌。UCA的蛋白质组复杂性表明,在原核生物和真核生物这两个原始谱系中,细胞表型的进化很大程度上是由常见超家族的复制以及独特超家族的丢失驱动的。最后,新型超家族的创新在原核生物的进化中所起的作用小得出奇,在真核生物的进化中只起了边缘作用。