Alexandraki Voula, Kazou Maria, Blom Jochen, Pot Bruno, Papadimitriou Konstantinos, Tsakalidou Effie
Laboratory of Dairy Research, Department of Food Science and Human Nutrition, Agricultural University of Athens, Athens, Greece.
Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, Germany.
Front Microbiol. 2019 Dec 20;10:2916. doi: 10.3389/fmicb.2019.02916. eCollection 2019.
is a major starter for the dairy industry with great economic importance. In this study we analyzed 23 fully sequenced genomes of to highlight novel aspects of the evolution, biology and technological properties of this species. Pan/core genome analysis revealed that the species has an important number of conserved genes and that the pan genome is probably going to be closed soon. According to whole genome phylogeny and average nucleotide identity (ANI) analysis, most strains were grouped in two major clusters (i.e., clusters A and B). More specifically, cluster A includes strains with chromosomes above 1.83 Mbp, while cluster B includes chromosomes below this threshold. This observation suggests that strains belonging to the two clusters may be differentiated by gene gain or gene loss events. Furthermore, certain strains of cluster A could be further subdivided in subgroups, i.e., subgroup I (ASCC 1275, DGCC 7710, KLDS SM, MN-BM-A02, and ND07), II (MN-BM-A01 and MN-ZLW-002), III (LMD-9 and SMQ-301), and IV (APC151 and ND03). In cluster B certain strains formed one distinct subgroup, i.e., subgroup I (CNRZ1066, CS8, EPS, and S9). Clusters and subgroups observed for indicate the existence of lineages within the species, an observation which was further supported to a variable degree by the distribution and/or the architecture of several genomic traits. These would include exopolysaccharide (EPS) gene clusters, Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs)-CRISPR associated (Cas) systems, as well as restriction-modification (R-M) systems and genomic islands (GIs). Of note, the histidine biosynthetic cluster was found present in all cluster A strains (plus strain NCTC12958) but was absent from all strains in cluster B. Other loci related to lactose/galactose catabolism and urea metabolism, aminopeptidases, the majority of amino acid and peptide transporters, as well as amino acid biosynthetic pathways were found to be conserved in all strains suggesting their central role for the species. Our study highlights the necessity of sequencing and analyzing more complete genomes to further elucidate important aspects of strain diversity within this starter culture that may be related to its application in the dairy industry.
是乳制品行业的主要起始菌株,具有重要的经济意义。在本研究中,我们分析了23个 的全序列基因组,以突出该物种进化、生物学和技术特性的新方面。泛基因组/核心基因组分析表明,该物种有大量保守基因,且泛基因组可能很快就会封闭。根据全基因组系统发育和平均核苷酸同一性(ANI)分析,大多数 菌株被分为两个主要簇(即簇A和簇B)。更具体地说,簇A包括染色体大于1.83 Mbp的菌株,而簇B包括低于此阈值的染色体。这一观察结果表明,属于这两个簇的菌株可能通过基因获得或基因丢失事件而有所分化。此外,簇A的某些菌株可进一步细分为亚组,即亚组I(ASCC 1275、DGCC 7710、KLDS SM、MN-BM-A02和ND07)、II(MN-BM-A01和MN-ZLW-002)、III(LMD-9和SMQ-301)和IV(APC151和ND03)。在簇B中,某些菌株形成了一个独特的亚组,即亚组I(CNRZ1066、CS8、EPS和S9)。观察到的 簇和亚组表明该物种内存在谱系,这一观察结果在不同程度上得到了几种基因组特征的分布和/或结构的进一步支持。这些特征包括胞外多糖(EPS)基因簇、成簇规律间隔短回文重复序列(CRISPRs)-CRISPR相关(Cas)系统,以及限制修饰(R-M)系统和基因组岛(GIs)。值得注意的是,组氨酸生物合成簇在所有簇A菌株(加上菌株NCTC12958)中均有发现,但在簇B的所有菌株中均未发现。与乳糖/半乳糖分解代谢和尿素代谢、氨肽酶、大多数氨基酸和肽转运蛋白以及氨基酸生物合成途径相关的其他位点在所有菌株中均保守,表明它们对该物种的核心作用。我们的研究强调了对更多 完整基因组进行测序和分析的必要性,以进一步阐明这种起始培养物中菌株多样性的重要方面,这些方面可能与其在乳制品行业中的应用有关。