IMG-ABC：一个促进生物合成基因簇和新型次生代谢产物发现的知识库。

IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

作者信息

Hadjithomas Michalis, Chen I-Min Amy, Chu Ken, Ratner Anna, Palaniappan Krishna, Szeto Ernest, Huang Jinghua, Reddy T B K, Cimermančič Peter, Fischbach Michael A, Ivanova Natalia N, Markowitz Victor M, Kyrpides Nikos C, Pati Amrita

机构信息

Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, California, USA.

Biosciences Computing, Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.

出版信息

mBio. 2015 Jul 14;6(4):e00932. doi: 10.1128/mBio.00932-15.

DOI:10.1128/mBio.00932-15

PMID:26173699

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4502231/

Abstract

UNLABELLED

In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules.

IMPORTANCE

IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world.

摘要

未标记

在次生代谢产物的发现过程中，序列数据分析是一条很有前景的探索途径，但由于缺乏能大规模实现这种系统方法的计算平台，该途径在很大程度上仍未得到充分利用。在这项工作中，我们展示了IMG-ABC（https://img.jgi.doe.gov/abc），这是集成微生物基因组（IMG）系统内的一个生物合成基因簇图谱，旨在利用“大”基因组数据的力量来发现小分子。IMG-ABC依靠IMG全面的综合结构和功能基因组数据来分析生物合成基因簇（BCs）及相关次生代谢产物（SMs）。SMs和BCs是IMG-ABC中的两类主要对象，每类都有丰富的属性集合。IMG-ABC的一个独特特征是将基因组以及宏基因组中经过实验验证和计算预测的BCs都纳入其中，从而识别未培养群体和稀有分类单元中的BCs。我们通过首次在α-变形菌纲中发现产吩嗪簇，展示了IMG-ABC专注的综合分析工具在全球范围内探索微生物次生代谢方面的优势。IMG-ABC努力填补长期存在的次生代谢领域计算探索资源空白；其底层的可扩展框架能够遍历未被探索的系统发育和化学结构空间，成为发现新分子新时代的一扇大门。

重要性

IMG-ABC是最大的公开可用的预测和实验生物合成基因簇及其产生的次生代谢产物的数据库。该系统还包括与IMG广泛的基因组/宏基因组数据及分析工具包集成的强大搜索和分析工具。随着关于生物合成基因簇和次生代谢产物的新研究不断发表以及更多基因组被测序，IMG-ABC将持续扩展，目标是成为次生代谢领域任何生物信息学探索的重要组成部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d25/4502231/7e8672f4ef57/mbo0041523940001.jpg

相似文献

IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

mBio. 2015 Jul 14;6(4):e00932. doi: 10.1128/mBio.00932-15.

IMG-ABC v.5.0: an update to the IMG/Atlas of Biosynthetic Gene Clusters Knowledgebase.

Nucleic Acids Res. 2020 Jan 8;48(D1):D422-D430. doi: 10.1093/nar/gkz932.

IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes.

Nucleic Acids Res. 2017 Jan 4;45(D1):D560-D565. doi: 10.1093/nar/gkw1103. Epub 2016 Nov 29.

IMG 4 version of the integrated microbial genomes comparative analysis system.

Nucleic Acids Res. 2014 Jan;42(Database issue):D560-7. doi: 10.1093/nar/gkt963. Epub 2013 Oct 27.

Gene context analysis in the Integrated Microbial Genomes (IMG) data management system.

PLoS One. 2009 Nov 24;4(11):e7979. doi: 10.1371/journal.pone.0007979.

IMG ER: a system for microbial genome annotation expert review and curation.

Bioinformatics. 2009 Sep 1;25(17):2271-8. doi: 10.1093/bioinformatics/btp393. Epub 2009 Jun 27.

The integrated microbial genomes system: an expanding comparative analysis resource.

Nucleic Acids Res. 2010 Jan;38(Database issue):D382-90. doi: 10.1093/nar/gkp887. Epub 2009 Oct 28.

IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes.

Nucleic Acids Res. 2019 Jan 8;47(D1):D666-D677. doi: 10.1093/nar/gky901.

BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters.

Gigascience. 2021 Jan 13;10(1). doi: 10.1093/gigascience/giaa154.

IMG/M: integrated genome and metagenome comparative data analysis system.

Nucleic Acids Res. 2017 Jan 4;45(D1):D507-D516. doi: 10.1093/nar/gkw929. Epub 2016 Oct 13.

引用本文的文献

South African Myxococcota: an untapped resource for microbial ecolo gy and biotechnology.

Appl Microbiol Biotechnol. 2025 Aug 30;109(1):192. doi: 10.1007/s00253-025-13586-z.

Effects of photosynthetic properties, soil environment and bioactive constituents of Eucommia ulmoides Oliver under two planting models.

BMC Plant Biol. 2025 Jul 2;25(1):826. doi: 10.1186/s12870-025-06779-1.

Multi-omic analysis tools for microbial metabolites prediction.

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae264.

Bacteriocin diversity, function, discovery and application as antimicrobials.

Nat Rev Microbiol. 2024 Sep;22(9):556-571. doi: 10.1038/s41579-024-01045-x. Epub 2024 May 10.

Insights into soil nematode diversity and bacterial community of Thai jasmine rice rhizosphere from different paddy fields in Thailand.

PeerJ. 2024 Apr 23;12:e17289. doi: 10.7717/peerj.17289. eCollection 2024.

Screening of natural phenazine producers for electroactivity in bioelectrochemical systems.

Microb Biotechnol. 2023 Mar;16(3):579-594. doi: 10.1111/1751-7915.14199. Epub 2022 Dec 26.

Functional characterization of prokaryotic dark matter: the road so far and what lies ahead.

Curr Res Microb Sci. 2022 Aug 7;3:100159. doi: 10.1016/j.crmicr.2022.100159. eCollection 2022.

Natural-Product-Based Solutions for Tropical Infectious Diseases.

Clin Microbiol Rev. 2021 Dec 15;34(4):e0034820. doi: 10.1128/CMR.00348-20. Epub 2021 Sep 8.

Genome mining for drug discovery: progress at the front end.

J Ind Microbiol Biotechnol. 2021 Dec 23;48(9-10). doi: 10.1093/jimb/kuab044.

host for refactoring of diverse bioactive secondary metabolites.

3 Biotech. 2021 Jul;11(7):340. doi: 10.1007/s13205-021-02872-y. Epub 2021 Jun 16.

本文引用的文献

The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Nucleic Acids Res. 2015 Jan;43(Database issue):D1099-106. doi: 10.1093/nar/gku950. Epub 2014 Oct 27.

Biosynthesis: bioinformatics bolster a renaissance.

Nat Chem Biol. 2014 Oct;10(10):798-800. doi: 10.1038/nchembio.1634.

Insights into secondary metabolism from a global analysis of prokaryotic biosynthetic gene clusters.

Cell. 2014 Jul 17;158(2):412-421. doi: 10.1016/j.cell.2014.06.034.

Pfam: the protein families database.

Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.

GenBank.

Nucleic Acids Res. 2014 Jan;42(Database issue):D32-7. doi: 10.1093/nar/gkt1030. Epub 2013 Nov 11.

IMG 4 version of the integrated microbial genomes comparative analysis system.

Nucleic Acids Res. 2014 Jan;42(Database issue):D560-7. doi: 10.1093/nar/gkt963. Epub 2013 Oct 27.

IMG/M 4 version of the integrated metagenome comparative analysis system.

Nucleic Acids Res. 2014 Jan;42(Database issue):D568-73. doi: 10.1093/nar/gkt919. Epub 2013 Oct 16.

fmcsR: mismatch tolerant maximum common substructure searching in R.

Bioinformatics. 2013 Nov 1;29(21):2792-4. doi: 10.1093/bioinformatics/btt475. Epub 2013 Aug 20.

Insights into the phylogeny and coding potential of microbial dark matter.

Nature. 2013 Jul 25;499(7459):431-7. doi: 10.1038/nature12352. Epub 2013 Jul 14.

antiSMASH 2.0--a versatile platform for genome mining of secondary metabolite producers.

Nucleic Acids Res. 2013 Jul;41(Web Server issue):W204-12. doi: 10.1093/nar/gkt449. Epub 2013 Jun 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

IMG-ABC：一个促进生物合成基因簇和新型次生代谢产物发现的知识库。

IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

作者信息

机构信息

Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, California, USA.

Biosciences Computing, Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.

出版信息

mBio. 2015 Jul 14;6(4):e00932. doi: 10.1128/mBio.00932-15.

DOI:10.1128/mBio.00932-15

PMID:26173699

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4502231/

Abstract

UNLABELLED

IMPORTANCE

摘要

IMG-ABC：一个促进生物合成基因簇和新型次生代谢产物发现的知识库。

IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

作者信息

机构信息

出版信息

UNLABELLED

IMPORTANCE

未标记

重要性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

IMG-ABC：一个促进生物合成基因簇和新型次生代谢产物发现的知识库。

IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

作者信息

机构信息

出版信息

UNLABELLED

IMPORTANCE

未标记

重要性

相似文献

引用本文的文献

本文引用的文献