• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于机器学习的生物信息学方法,可从生物合成基因簇中预测生物活性。

A Machine Learning Bioinformatics Method to Predict Biological Activity from Biosynthetic Gene Clusters.

机构信息

Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115, United States.

出版信息

J Chem Inf Model. 2021 Jun 28;61(6):2560-2571. doi: 10.1021/acs.jcim.0c01304. Epub 2021 May 27.

DOI:10.1021/acs.jcim.0c01304
PMID:34042443
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8243324/
Abstract

Research in natural products, the genetically encoded small molecules produced by organisms in an idiosyncratic fashion, deals with molecular structure, biosynthesis, and biological activity. Bioinformatics analyses of microbial genomes can successfully reveal the genetic instructions, biosynthetic gene clusters, that produce many natural products. Genes to molecule predictions made on biosynthetic gene clusters have revealed many important new structures. There is no comparable method for genes to biological activity predictions. To address this missing pathway, we developed a machine learning bioinformatics method for predicting a natural product's antibiotic activity directly from the sequence of its biosynthetic gene cluster. We trained commonly used machine learning classifiers to predict antibacterial or antifungal activity based on features of known natural product biosynthetic gene clusters. We have identified classifiers that can attain accuracies as high as 80% and that have enabled the identification of biosynthetic enzymes and their corresponding molecular features that are associated with antibiotic activity.

摘要

天然产物研究,即生物体以特殊方式产生的遗传编码小分子,涉及分子结构、生物合成和生物活性。对微生物基因组的生物信息学分析可以成功揭示产生许多天然产物的遗传指令和生物合成基因簇。基于生物合成基因簇进行的基因到分子预测已经揭示了许多重要的新结构。目前还没有可比的方法可以从基因预测到生物活性。为了解决这个缺失的途径,我们开发了一种机器学习生物信息学方法,可直接从其生物合成基因簇的序列预测天然产物的抗生素活性。我们训练了常用的机器学习分类器,根据已知天然产物生物合成基因簇的特征来预测抗菌或抗真菌活性。我们已经确定了可以达到高达 80%准确率的分类器,并且能够鉴定出与抗生素活性相关的生物合成酶及其相应的分子特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/0f9ab3723c1b/ci0c01304_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/54e491ba6c7b/ci0c01304_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/98923f16a899/ci0c01304_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/90f27960264f/ci0c01304_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/315cb27b86a7/ci0c01304_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/b8fa2b042338/ci0c01304_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/0f9ab3723c1b/ci0c01304_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/54e491ba6c7b/ci0c01304_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/98923f16a899/ci0c01304_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/90f27960264f/ci0c01304_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/315cb27b86a7/ci0c01304_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/b8fa2b042338/ci0c01304_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8f8/8243324/0f9ab3723c1b/ci0c01304_0007.jpg

相似文献

1
A Machine Learning Bioinformatics Method to Predict Biological Activity from Biosynthetic Gene Clusters.一种基于机器学习的生物信息学方法,可从生物合成基因簇中预测生物活性。
J Chem Inf Model. 2021 Jun 28;61(6):2560-2571. doi: 10.1021/acs.jcim.0c01304. Epub 2021 May 27.
2
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.基于生物合成基因簇数据利用机器学习预测真菌次生代谢物活性。
Microbiol Spectr. 2024 Feb 6;12(2):e0340023. doi: 10.1128/spectrum.03400-23. Epub 2024 Jan 9.
3
Diversity and taxonomic distribution of bacterial biosynthetic gene clusters predicted to produce compounds with therapeutically relevant bioactivities.预测可产生具有治疗相关生物活性化合物的细菌生物合成基因簇的多样性和分类分布。
J Ind Microbiol Biotechnol. 2023 Feb 17;50(1). doi: 10.1093/jimb/kuad024.
4
Expanding our understanding of sequence-function relationships of type II polyketide biosynthetic gene clusters: bioinformatics-guided identification of Frankiamicin A from Frankia sp. EAN1pec.拓展我们对II型聚酮生物合成基因簇序列-功能关系的理解:基于生物信息学从Frankia sp. EAN1pec中鉴定出弗兰基霉素A。
PLoS One. 2015 Apr 2;10(4):e0121505. doi: 10.1371/journal.pone.0121505. eCollection 2015.
5
A Multi-Label Learning Framework for Predicting Chemical Classes and Biological Activities of Natural Products from Biosynthetic Gene Clusters.一种用于从生物合成基因簇预测天然产物化学类别和生物活性的多标签学习框架。
J Chem Ecol. 2023 Dec;49(11-12):681-695. doi: 10.1007/s10886-023-01452-z. Epub 2023 Oct 2.
6
Antimicrobials Inspired by Nonribosomal Peptide Synthetase Gene Clusters.非核糖体肽合成酶基因簇启示下的抗菌药物
J Am Chem Soc. 2017 Feb 1;139(4):1404-1407. doi: 10.1021/jacs.6b11861. Epub 2017 Jan 23.
7
The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles.含有噻唑和恶唑杂环的核糖体肽的基因组格局。
BMC Genomics. 2015 Oct 13;16:778. doi: 10.1186/s12864-015-2008-0.
8
Synthetic-Bioinformatic Natural Product Antibiotics with Diverse Modes of Action.具有多种作用模式的合成生物信息天然产物抗生素。
J Am Chem Soc. 2020 Aug 19;142(33):14158-14168. doi: 10.1021/jacs.0c04376. Epub 2020 Aug 11.
9
Analysis of Co-localized Biosynthetic Gene Clusters Identifies a Membrane-Permeabilizing Natural Product.共定位生物合成基因簇分析鉴定出一种具有膜透性的天然产物。
J Nat Prod. 2024 Jul 26;87(7):1694-1703. doi: 10.1021/acs.jnatprod.3c01231. Epub 2024 Jul 1.
10
Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides.通过整合泛基因组学和机器学习来扩展 RiPP 生物合成空间,揭示了一类新型的硫肽。
PLoS Biol. 2020 Dec 22;18(12):e3001026. doi: 10.1371/journal.pbio.3001026. eCollection 2020 Dec.

引用本文的文献

1
Mobilome-mediated transcriptional activation of biosynthetic gene clusters and its impact on strain competitiveness in food fermentation microbiomes.移动基因组介导的生物合成基因簇转录激活及其对食品发酵微生物群落中菌株竞争力的影响。
Microbiome. 2025 Aug 28;13(1):191. doi: 10.1186/s40168-025-02180-0.
2
Green genes from blue greens: challenges and solutions to unlocking the potential of cyanobacteria in drug discovery.来自蓝细菌的绿色基因:挖掘蓝藻细菌在药物研发中潜力的挑战与解决方案
Nat Prod Rep. 2025 Jul 15. doi: 10.1039/d5np00016e.
3
Advances in the discovery and study of natural products for biological control applications.

本文引用的文献

1
Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences.从微生物基因组序列中综合预测次生代谢物结构和生物活性。
Nat Commun. 2020 Nov 27;11(1):6058. doi: 10.1038/s41467-020-19986-1.
2
The antiSMASH database version 3: increased taxonomic coverage and new query features for modular enzymes.反 SMASH 数据库版本 3:增加了分类学覆盖范围和新的模块酶查询功能。
Nucleic Acids Res. 2021 Jan 8;49(D1):D639-D643. doi: 10.1093/nar/gkaa978.
3
A Deep Learning Approach to Antibiotic Discovery.
用于生物防治应用的天然产物的发现与研究进展。
Nat Prod Rep. 2025 Jun 6. doi: 10.1039/d5np00017c.
4
Multidimensional insights into the biodiversity of in soils of China: a pilot study.对中国土壤生物多样性的多维度洞察:一项初步研究。
Microbiol Spectr. 2025 Apr 2;13(5):e0169224. doi: 10.1128/spectrum.01692-24.
5
AI-assisted isolation of bioactive Dipyrimicins from and identification of its corresponding biosynthetic gene cluster.通过人工智能辅助从……中分离生物活性双嘧啶菌素并鉴定其相应的生物合成基因簇。
bioRxiv. 2025 Mar 21:2025.03.21.644653. doi: 10.1101/2025.03.21.644653.
6
Amphibian skin bacteria contain a wide repertoire of genes linked to their antifungal capacities.两栖动物皮肤细菌含有大量与它们的抗真菌能力相关的基因。
World J Microbiol Biotechnol. 2025 Feb 27;41(3):78. doi: 10.1007/s11274-025-04292-z.
7
Artificial Intelligence in Natural Product Drug Discovery: Current Applications and Future Perspectives.天然产物药物发现中的人工智能:当前应用与未来展望。
J Med Chem. 2025 Feb 27;68(4):3948-3969. doi: 10.1021/acs.jmedchem.4c01257. Epub 2025 Feb 6.
8
Interpretable adenylation domain specificity prediction using protein language models.使用蛋白质语言模型进行可解释的腺苷化结构域特异性预测。
bioRxiv. 2025 Jan 18:2025.01.13.632878. doi: 10.1101/2025.01.13.632878.
9
Exploring the Promoter Generation and Prediction of spp. Based on GAN and Multi-Model Fusion Methods.基于生成对抗网络和多模型融合方法探索spp.的启动子生成与预测
Int J Mol Sci. 2024 Dec 6;25(23):13137. doi: 10.3390/ijms252313137.
10
Primed for Discovery.蓄势待发,探索无限。
Biochemistry. 2024 Nov 5;63(21):2705-2713. doi: 10.1021/acs.biochem.4c00464. Epub 2024 Oct 15.
一种用于抗生素发现的深度学习方法。
Cell. 2020 Apr 16;181(2):475-483. doi: 10.1016/j.cell.2020.04.001.
4
Biochemical and Structural Analysis of a Dehydrogenase, KanD2, and an Aminotransferase, KanS2, That Are Responsible for the Construction of the Kanosamine Moiety in Kanamycin Biosynthesis.在卡那霉素生物合成中负责构建卡那糖胺部分的脱氢酶 KanD2 和转氨酶 KanS2 的生化和结构分析。
Biochemistry. 2020 Apr 21;59(15):1470-1473. doi: 10.1021/acs.biochem.0c00204. Epub 2020 Apr 6.
5
Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019.天然产物:1981 年 1 月至 2019 年 9 月近四十年来的新药来源
J Nat Prod. 2020 Mar 27;83(3):770-803. doi: 10.1021/acs.jnatprod.9b01285. Epub 2020 Mar 12.
6
CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database.CARD 2020:利用综合抗生素耐药数据库进行抗生素耐药组监测。
Nucleic Acids Res. 2020 Jan 8;48(D1):D517-D525. doi: 10.1093/nar/gkz935.
7
MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0:已知功能的生物合成基因簇的存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.
8
NeuRiPP: Neural network identification of RiPP precursor peptides.NeuRiPP:神经网络鉴定 RiPP 前体肽。
Sci Rep. 2019 Sep 16;9(1):13406. doi: 10.1038/s41598-019-49764-z.
9
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
10
antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline.antiSMASH 5.0:二次代谢产物基因组挖掘管道的更新。
Nucleic Acids Res. 2019 Jul 2;47(W1):W81-W87. doi: 10.1093/nar/gkz310.