Suppr超能文献

抗 SMASH 及其他计算方法在挖掘次级代谢产物生物合成基因簇方面的最新进展。

Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters.

出版信息

Brief Bioinform. 2019 Jul 19;20(4):1103-1113. doi: 10.1093/bib/bbx146.

Abstract

Many drugs are derived from small molecules produced by microorganisms and plants, so-called natural products. Natural products have diverse chemical structures, but the biosynthetic pathways producing those compounds are often organized as biosynthetic gene clusters (BGCs) and follow a highly conserved biosynthetic logic. This allows for the identification of core biosynthetic enzymes using genome mining strategies that are based on the sequence similarity of the involved enzymes/genes. However, mining for a variety of BGCs quickly approaches a complexity level where manual analyses are no longer possible and require the use of automated genome mining pipelines, such as the antiSMASH software. In this review, we discuss the principles underlying the predictions of antiSMASH and other tools and provide practical advice for their application. Furthermore, we discuss important caveats such as rule-based BGC detection, sequence and annotation quality and cluster boundary prediction, which all have to be considered while planning for, performing and analyzing the results of genome mining studies.

摘要

许多药物源自微生物和植物产生的小分子,即所谓的天然产物。天然产物具有多样的化学结构,但产生这些化合物的生物合成途径通常被组织为生物合成基因簇 (BGC),并遵循高度保守的生物合成逻辑。这使得可以使用基于涉及的酶/基因的序列相似性的基因组挖掘策略来鉴定核心生物合成酶。然而,对各种 BGC 的挖掘很快就达到了一个复杂程度,以至于手动分析不再可行,需要使用自动化的基因组挖掘管道,例如 antiSMASH 软件。在这篇综述中,我们讨论了 antiSMASH 和其他工具的预测所依据的原则,并为它们的应用提供了实用建议。此外,我们还讨论了一些重要的注意事项,例如基于规则的 BGC 检测、序列和注释质量以及簇边界预测,在规划、执行和分析基因组挖掘研究的结果时都必须考虑到这些注意事项。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37c/6781578/6adeaa39390a/bbx146f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验