Department of Medical Microbiology and Immunology, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA.
Microbiology Doctoral Training Program, University of Wisconsin, Madison, WI, USA.
Microb Genom. 2023 Apr;9(4). doi: 10.1099/mgen.0.000988.
Bacterial secondary metabolites, synthesized by enzymes encoded in biosynthetic gene clusters (BGCs), can underlie microbiome homeostasis and serve as commercialized products, which have historically been mined from a select group of taxa. While evolutionary approaches have proven beneficial for prioritizing BGCs for experimental characterization efforts to uncover new natural products, dedicated bioinformatics tools designed for comparative and evolutionary analysis of BGCs within focal taxa are limited. We thus developed ineage pecific nalysis of BGCs (BGC; https://github.com/Kalan-Lab/lsaBGC) to aid exploration of microdiversity and evolutionary trends across homologous groupings of BGCs, gene cluster families (GCFs), in any bacterial taxa of interest. BGC enables rapid and direct identification of GCFs in genomes, calculates evolutionary statistics and conservation for BGC genes, and builds a framework to allow for base resolution mining of novel variants through metagenomic exploration. Through application of the suite to four genera commonly found in skin microbiomes, we uncover new insights into the evolution and diversity of their BGCs. We show that the BGC of the virulence-associated carotenoid staphyloxanthin in is ubiquitous across the genus . While one GCF encoding the biosynthesis of staphyloxanthin showcases evidence for plasmid-mediated horizontal gene transfer (HGT) between species, another GCF appears to be transmitted vertically amongst a sub-clade of skin-associated . Further, the latter GCF, which is well conserved in , has been lost in most , which is the most common species on human skin and is also regarded as a commensal. We also identify thousands of novel single-nucleotide variants (SNVs) within BGCs from the sp. complex, a narrow, multi-species clade that features the most prevalent in healthy skin microbiomes. Although novel SNVs were approximately 10 times as likely to correspond to synonymous changes when located in the top five percentile of conserved sites, BGC identified SNVs that defied this trend and are predicted to underlie amino acid changes within functionally key enzymatic domains. Ultimately, beyond supporting evolutionary investigations of BGCs, BGC also provides important functionalities to aid efforts for the discovery or directed modification of natural products.
细菌次生代谢产物是由生物合成基因簇 (BGCs) 编码的酶合成的,可以维持微生物组的平衡,并作为商业化产品,这些产物历史上是从一组选定的分类群中挖掘出来的。虽然进化方法已被证明有助于优先考虑 BGC 进行实验表征工作,以发现新的天然产物,但专门用于聚焦分类群内 BGCs 的比较和进化分析的专用生物信息学工具是有限的。因此,我们开发了细菌生物合成基因簇的谱系特异性分析 (BGC; https://github.com/Kalan-Lab/lsaBGC),以帮助探索任何感兴趣的细菌分类群中同源 BGC 分组、基因簇家族 (GCF) 的微多样性和进化趋势。BGC 能够快速直接识别基因组中的 GCF,计算 BGC 基因的进化统计数据和保守性,并构建一个框架,允许通过宏基因组探索进行碱基分辨率挖掘新型变体。通过将该套件应用于通常在皮肤微生物组中发现的四个属,我们揭示了它们 BGC 的进化和多样性的新见解。我们表明,与毒力相关的类胡萝卜素番茄红素的 BGC 在 属中无处不在。虽然一个 GCF 编码番茄红素的生物合成显示了物种间质粒介导水平基因转移 (HGT) 的证据,但另一个 GCF 似乎在皮肤相关 的一个亚群中垂直传播。此外,在后一个 GCF 中,它在 中很好地保守,但在大多数 中丢失了, 是人类皮肤上最常见的 物种,也被认为是共生菌。我们还在 种复合体的 BGC 中鉴定出数千个新的单核苷酸变体 (SNV),该复合体是一个狭窄的多物种分支,其中包含健康皮肤微生物组中最常见的 。尽管新的 SNV 位于保守性最高的前 5%的位置时,大约有 10 倍的可能性对应同义变化,但 BGC 鉴定出的 SNV 违背了这一趋势,预计会导致功能关键酶结构域内的氨基酸变化。最终,除了支持 BGC 的进化研究外,BGC 还提供了重要的功能,以帮助发现或定向修饰天然产物的工作。