乳酸杆菌系统基因组学——迈向该属的重新分类

Lactobacillus phylogenomics--towards a reclassification of the genus.

作者信息

Claesson Marcus J, van Sinderen Douwe, O'Toole Paul W

机构信息

Department of Microbiology and Alimentary Pharmabiotic Centre, University College Cork, Ireland.

出版信息

Int J Syst Evol Microbiol. 2008 Dec;58(Pt 12):2945-54. doi: 10.1099/ijs.0.65848-0.

Abstract

The extremely diverse genus Lactobacillus is the largest among the lactic acid bacteria, with over 145 recognized species. In this work, which to our knowledge is the largest comparative phylogenomics study of a single genus to date, 12 genomes of Lactobacillus strains were subjected to an array of whole-genome and single-marker phylogenetic approaches, to investigate the case for extracting subgeneric groups and to determine whether a single congruent phylogeny could be identified. We conclude that GroEL is a more robust single-gene phylogenetic marker for the genus Lactobacillus than the 16S rRNA gene, when no whole-genome information is available. Significant incongruence was found, both within a set of trees based on 141 core proteins and within those phylogenies based on numbers of orthologues, concatenated RNA polymerase subunits and single gene/protein markers. This is possibly due to different evolutionary rates, hidden paralogies or horizontal gene transfer. Such phylogenetic ambiguities are efficiently visualized with cluster-networks. Although the genus contains some highly unstable taxa, four subgeneric groups were distinguished. Qualitative and quantitative gene analysis of these groups resulted in three findings: there is a relatively small number of group-specific proteins, the majority of which are poorly characterized; major groupings are functionally better distinguishable by absent genes rather than gained/retained genes; and, finally, a gene cluster possibly involved in purine metabolism is uniquely present in four lactobacilli associated with meat. In conclusion, because of either significantly different branching patterns or the availability of too few members, three of the four identified groups could not serve as the basis for identifying candidate novel genera within the current genus. We therefore suggest targeted sequencing of key taxonomic species identified here, which are likely to add sufficient depth for a future reclassification, followed by phylogenomic analysis involving the core proteins identified here. This will ideally be combined with phenotypic data using a polyphasic approach.

摘要

极其多样的乳酸杆菌属是乳酸菌中最大的属,有超过145个已确认的物种。在这项据我们所知是迄今为止对单一属进行的最大规模比较系统基因组学研究中,对12株乳酸杆菌菌株的基因组采用了一系列全基因组和单标记系统发育方法,以研究提取亚属组的情况,并确定是否能识别出单一一致的系统发育关系。我们得出结论,当没有全基因组信息时,GroEL基因作为乳酸杆菌属的单基因系统发育标记比16S rRNA基因更可靠。在基于141个核心蛋白构建的一组树中以及基于直系同源物数量、串联RNA聚合酶亚基和单基因/蛋白标记构建的系统发育树中,都发现了显著的不一致性。这可能是由于不同的进化速率、隐藏的旁系同源性或水平基因转移导致的。通过聚类网络可以有效地可视化这种系统发育的模糊性。尽管该属包含一些高度不稳定的分类群,但仍区分出了四个亚属组。对这些组进行的定性和定量基因分析得出了三个发现:组特异性蛋白的数量相对较少,其中大多数特征不明确;主要分组在功能上通过缺失基因比通过获得/保留基因更容易区分;最后,一个可能参与嘌呤代谢的基因簇仅在与肉类相关的四种乳酸杆菌中存在。总之,由于分支模式显著不同或成员数量过少,四个已识别的组中有三个不能作为在当前属内识别候选新属的基础。因此,我们建议对这里确定的关键分类物种进行靶向测序,这可能会为未来的重新分类增加足够的深度,随后进行涉及这里确定的核心蛋白的系统基因组学分析。理想情况下,这将与使用多相方法的表型数据相结合。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索