Hadjithomas Michalis, Chen I-Min A, Chu Ken, Huang Jinghua, Ratner Anna, Palaniappan Krishna, Andersen Evan, Markowitz Victor, Kyrpides Nikos C, Ivanova Natalia N
Microbial Genome and Metagenome Program, Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
Biosciences Computing, Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
Nucleic Acids Res. 2017 Jan 4;45(D1):D560-D565. doi: 10.1093/nar/gkw1103. Epub 2016 Nov 29.
Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.
微生物产生的次生代谢产物具有多种生物学功能,这使其成为具有抗菌、抗癌和其他活性的生物技术相关化合物的巨大潜在来源。合成这些天然产物所需的蛋白质通常由位于同一位置的基因簇编码,这些基因簇称为生物合成基因簇(BCs)。为了推进对微生物次生代谢的探索,我们开发了最大的公开可用的经过实验验证和预测的BCs数据库——生物合成基因簇综合微生物基因组图谱(IMG-ABC)(https://img.jgi.doe.gov/abc/)。在此,我们描述了IMG-ABC的更新内容,其中包括ClusterScout,这是一种用于在40000个分离微生物基因组中靶向识别定制生物合成基因簇的工具,以及一种新的搜索功能,可从分离基因组中查询超过700000个BCs,以查找具有相似Pfam组成的基因簇。其他功能通过两个新的交互式可视化功能——BC功能热图和BC相似性网络图,实现了对BCs的快速探索和分析。这些新工具和功能增加了IMG-ABC大量BC数据的价值,便于对其进行深入分析并加速次生代谢产物的发现。