微生物天然产物数据库：在多组学时代的发展。

Microbial natural product databases: moving forward in the multi-omics era.

机构信息

Department of Chemistry, Simon Fraser University, Burnaby, CA, USA.

出版信息

Nat Prod Rep. 2021 Jan 1;38(1):264-278. doi: 10.1039/d0np00053a. Epub 2020 Aug 28.

DOI:10.1039/d0np00053a

PMID:32856641

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7864863/

Abstract

Covering: 2010-2020The digital revolution is driving significant changes in how people store, distribute, and use information. With the advent of new technologies around linked data, machine learning and large-scale network inference, the natural products research field is beginning to embrace real-time sharing and large-scale analysis of digitized experimental data. Databases play a key role in this, as they allow systematic annotation and storage of data for both basic and advanced applications. The quality of the content, structure, and accessibility of these databases all contribute to their usefulness for the scientific community in practice. This review covers the development of databases relevant for microbial natural product discovery during the past decade (2010-2020), including repositories of chemical structures/properties, metabolomics, and genomic data (biosynthetic gene clusters). It provides an overview of the most important databases and their functionalities, highlights some early meta-analyses using such databases, and discusses basic principles to enable widespread interoperability between databases. Furthermore, it points out conceptual and practical challenges in the curation and usage of natural products databases. Finally, the review closes with a discussion of key action points required for the field moving forward, not only for database developers but for any scientist active in the field.

摘要

涵盖范围

2010-2020 年

数字革命正在推动人们存储、分发和使用信息的方式发生重大变化。随着围绕链接数据、机器学习和大规模网络推理的新技术的出现，天然产物研究领域开始接受数字化实验数据的实时共享和大规模分析。数据库在这方面发挥着关键作用，因为它们允许对数据进行系统注释和存储，适用于基础和高级应用。这些数据库的内容质量、结构和可访问性都有助于提高它们在实践中对科学界的有用性。

本篇综述涵盖了过去十年（2010-2020 年）与微生物天然产物发现相关的数据库的发展情况，包括化学结构/性质、代谢组学和基因组数据（生物合成基因簇）的存储库。它概述了最重要的数据库及其功能，强调了一些早期使用这些数据库的元分析，并讨论了实现数据库之间广泛互操作性的基本原则。此外，它还指出了天然产物数据库的管理和使用方面存在的概念和实际挑战。最后，本文讨论了该领域向前发展所需的关键要点，不仅适用于数据库开发人员，也适用于该领域的任何科学家。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a866/7864863/78e03a76726f/nihms-1625779-f0002.jpg

相似文献

Microbial natural product databases: moving forward in the multi-omics era.微生物天然产物数据库：在多组学时代的发展。

Nat Prod Rep. 2021 Jan 1;38(1):264-278. doi: 10.1039/d0np00053a. Epub 2020 Aug 28.

The year 2020 in natural product bioinformatics: an overview of the latest tools and databases.2020 年天然产物生物信息学：最新工具和数据库概述。

Nat Prod Rep. 2021 Mar 4;38(2):301-306. doi: 10.1039/d0np00090f.

Metabolomics and genomics in natural products research: complementary tools for targeting new chemical entities.代谢组学和基因组学在天然产物研究中的应用：靶向新化学实体的互补工具。

Nat Prod Rep. 2021 Nov 17;38(11):2041-2065. doi: 10.1039/d1np00036e.

Leveraging Microbial Genomes and Genomic Context for Chemical Discovery.利用微生物基因组和基因组背景进行化学发现。

Acc Chem Res. 2021 Jul 6;54(13):2788-2797. doi: 10.1021/acs.accounts.1c00100. Epub 2021 Jun 4.

Meta-omic characterization of prokaryotic gene clusters for natural product biosynthesis.原核生物天然产物生物合成基因簇的宏基因组学特征分析。

Curr Opin Biotechnol. 2013 Dec;24(6):1151-8. doi: 10.1016/j.copbio.2013.05.001. Epub 2013 May 31.

MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters.MIBiG 3.0：一个社区驱动的努力，用于注释经过实验验证的生物合成基因簇。

Nucleic Acids Res. 2023 Jan 6;51(D1):D603-D610. doi: 10.1093/nar/gkac1049.

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery.《天然产物图谱：微生物天然产物发现的开放获取知识库》

ACS Cent Sci. 2019 Nov 27;5(11):1824-1833. doi: 10.1021/acscentsci.9b00806. Epub 2019 Nov 14.

Heterologous expression of bacterial natural product biosynthetic pathways.细菌天然产物生物合成途径的异源表达。

Nat Prod Rep. 2019 Oct 16;36(10):1412-1436. doi: 10.1039/c8np00091c.

New voyages to explore the natural product galaxy.探索天然产物星系的新航程。

J Ind Microbiol Biotechnol. 2019 Mar;46(3-4):273-279. doi: 10.1007/s10295-018-02122-w. Epub 2019 Jan 4.

引用本文的文献

Green genes from blue greens: challenges and solutions to unlocking the potential of cyanobacteria in drug discovery.来自蓝细菌的绿色基因：挖掘蓝藻细菌在药物研发中潜力的挑战与解决方案

Nat Prod Rep. 2025 Jul 15. doi: 10.1039/d5np00016e.

Cancer chemoprevention: signaling pathways and strategic approaches.癌症化学预防：信号通路与策略方法

Signal Transduct Target Ther. 2025 Apr 18;10(1):113. doi: 10.1038/s41392-025-02167-1.

The workshops on computational applications in secondary metabolite discovery (CAiSMD).次生代谢产物发现中的计算应用研讨会（CAiSMD）

Phys Sci Rev. 2024 May 8;9(10):3289-3304. doi: 10.1515/psr-2024-0015. eCollection 2024 Oct.

Overview and limitations of database in global traditional medicines: A narrative review.全球传统医学数据库概述与局限性：一篇叙述性综述

Acta Pharmacol Sin. 2025 Feb;46(2):235-263. doi: 10.1038/s41401-024-01353-1. Epub 2024 Aug 2.

Marine Pharmacology in 2019-2021: Marine Compounds with Antibacterial, Antidiabetic, Antifungal, Anti-Inflammatory, Antiprotozoal, Antituberculosis and Antiviral Activities; Affecting the Immune and Nervous Systems, and Other Miscellaneous Mechanisms of Action.2019-2021 年海洋药理学：具有抗菌、抗糖尿病、抗真菌、抗炎、抗原生动物、抗结核和抗病毒活性的海洋化合物；影响免疫系统和神经系统以及其他各种作用机制。

Mar Drugs. 2024 Jun 30;22(7):309. doi: 10.3390/md22070309.

Discovery of Streptomyces species CS-62, a novel producer of the Acinetobacter baumannii selective antibiotic factumycin.发现链霉菌属CS-62，一种新型鲍曼不动杆菌选择性抗生素法克霉素的产生菌。

J Ind Microbiol Biotechnol. 2024 Jan 9;51. doi: 10.1093/jimb/kuae014.

PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processing.PhyloSophos：一种高通量科学名称映射算法，其增强了对分类学科学的明确考虑，及其在天然产物 (NP) 出现数据库处理中的应用。

BMC Bioinformatics. 2023 Dec 14;24(1):475. doi: 10.1186/s12859-023-05588-3.

Natural product biosynthetic potential reflects macroevolutionary diversification within a widely distributed bacterial taxon.天然产物生物合成潜力反映了广泛分布的细菌分类群内的宏观进化多样化。

mSystems. 2023 Dec 21;8(6):e0064323. doi: 10.1128/msystems.00643-23. Epub 2023 Nov 29.

EMNPD: a comprehensive endophytic microorganism natural products database for prompt the discovery of new bioactive substances.EMNPD：一个用于促进新型生物活性物质发现的内生微生物天然产物综合数据库。

J Cheminform. 2023 Nov 28;15(1):115. doi: 10.1186/s13321-023-00779-9.

A Data Deposition Platform for Sharing Nuclear Magnetic Resonance Data.用于共享磁共振数据的数据存储库平台。

J Nat Prod. 2023 Nov 24;86(11):2554-2561. doi: 10.1021/acs.jnatprod.3c00795. Epub 2023 Nov 7.

本文引用的文献

Review on natural products databases: where to find data in 2020.天然产物数据库综述：2020年何处获取数据

J Cheminform. 2020 Apr 3;12(1):20. doi: 10.1186/s13321-020-00424-9.

A unified catalog of 204,938 reference genomes from the human gut microbiome.人类肠道微生物组 204938 个参考基因组的统一目录。

Nat Biotechnol. 2021 Jan;39(1):105-114. doi: 10.1038/s41587-020-0603-3. Epub 2020 Jul 20.

Toward FAIRness and a User-Friendly Repository for Supporting NMR Data.迈向支持核磁共振数据的公平性与用户友好型知识库。

Org Lett. 2020 Apr 17;22(8):2867. doi: 10.1021/acs.orglett.0c01143. Epub 2020 Apr 3.

Disclosing the Potential of the SARP-Type Regulator PapR2 for the Activation of Antibiotic Gene Clusters in Streptomycetes.揭示SARP型调控因子PapR2激活链霉菌中抗生素基因簇的潜力。

Front Microbiol. 2020 Feb 18;11:225. doi: 10.3389/fmicb.2020.00225. eCollection 2020.

A Convolutional Neural Network-Based Approach for the Rapid Annotation of Molecularly Diverse Natural Products.基于卷积神经网络的方法用于快速注释分子多样的天然产物。

J Am Chem Soc. 2020 Mar 4;142(9):4114-4120. doi: 10.1021/jacs.9b13786. Epub 2020 Feb 21.

The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery.《天然产物图谱：微生物天然产物发现的开放获取知识库》

ACS Cent Sci. 2019 Nov 27;5(11):1824-1833. doi: 10.1021/acscentsci.9b00806. Epub 2019 Nov 14.

A computational framework to explore large-scale biosynthetic diversity.用于探索大规模生物合成多样性的计算框架。

Nat Chem Biol. 2020 Jan;16(1):60-68. doi: 10.1038/s41589-019-0400-9. Epub 2019 Nov 25.

MetaboLights: a resource evolving in response to the needs of its scientific community.代谢组学文献共享资源库（MetaboLights）：一个响应其科研群体需求而不断发展的资源库。

Nucleic Acids Res. 2020 Jan 8;48(D1):D440-D444. doi: 10.1093/nar/gkz1019.

MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0：已知功能的生物合成基因簇的存储库。

Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.

Compendium of 4,941 rumen metagenome-assembled genomes for rumen microbiome biology and enzyme discovery.4941 个瘤胃宏基因组组装基因组概述，用于瘤胃微生物组生物学和酶发现。

Nat Biotechnol. 2019 Aug;37(8):953-961. doi: 10.1038/s41587-019-0202-3. Epub 2019 Aug 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验