Aix-Marseille Université, CNRS, UMR 7283, Laboratoire de Chimie Bactérienne, IMM, 31 chemin Joseph Aiguier, F-13402 Marseille, France.
Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 43 boulevard du 11 novembre 1918, F-69622 Villeurbanne, France.
Mol Phylogenet Evol. 2014 Jun;75:103-17. doi: 10.1016/j.ympev.2014.02.013. Epub 2014 Feb 28.
The seminal work of Carl Woese and co-workers has contributed to promote the RNA component of the small subunit of the ribosome (SSU rRNA) as a "gold standard" of modern prokaryotic taxonomy and systematics, and an essential tool to explore microbial diversity. Yet, this marker has a limited resolving power, especially at deep phylogenetic depth and can lead to strongly biased trees. The ever-larger number of available complete genomes now calls for a novel standard dataset of robust protein markers that may complement SSU rRNA. In this respect, concatenation of ribosomal proteins (r-proteins) is being growingly used to reconstruct large-scale prokaryotic phylogenies, but their suitability for systematic and/or taxonomic purposes has not been specifically addressed. Using Proteobacteria as a case study, we show that amino acid and nucleic acid r-protein sequences contain a reliable phylogenetic signal at a wide range of taxonomic depths, which has not been totally blurred by mutational saturation or horizontal gene transfer. The use of accurate evolutionary models and reconstruction methods allows overcoming most tree reconstruction artefacts resulting from compositional biases and/or fast evolutionary rates. The inferred phylogenies allow clarifying the relationships among most proteobacterial orders and families, along with the position of several unclassified lineages, suggesting some possible revisions of the current classification. In addition, we investigate the root of the Proteobacteria by considering the time-variation of nucleic acid composition of r-protein sequences and the information carried by horizontal gene transfers, two approaches that do not require the use of an outgroup and limit tree reconstruction artefacts. Altogether, our analyses indicate that r-proteins may represent a promising standard for prokaryotic taxonomy and systematics.
卡尔·沃斯(Carl Woese)及其同事的开创性工作促进了核糖体小亚基的 RNA 成分(SSU rRNA)作为现代原核分类学和系统学的“金标准”,并成为探索微生物多样性的重要工具。然而,该标记的分辨率有限,特别是在深系统发育深度上,并且可能导致严重偏向的树。现在,可用的完整基因组数量不断增加,因此需要一种新的稳健蛋白质标记标准数据集来补充 SSU rRNA。在这方面,核糖体蛋白(r-蛋白)的串联越来越多地用于重建大规模的原核系统发育,但尚未专门针对系统学和/或分类学目的来解决其适用性。使用变形菌作为案例研究,我们表明,氨基酸和核酸 r-蛋白序列在广泛的分类深度范围内都具有可靠的系统发育信号,而这些信号并没有完全被突变饱和或水平基因转移所模糊。使用准确的进化模型和重建方法可以克服由于组成偏差和/或快速进化率而导致的大多数树重建伪影。推断出的系统发育关系可以阐明大多数变形菌目和科之间的关系,以及几个未分类谱系的位置,这表明当前的分类可能需要进行一些修订。此外,我们通过考虑 r-蛋白序列核酸组成的时间变化以及水平基因转移所携带的信息来研究变形菌的根,这两种方法都不需要使用外群并且限制了树重建伪影。总之,我们的分析表明 r-蛋白可能代表原核分类学和系统学的有前途的标准。