使用底物特异性隐马尔可夫模型集合对 NRPS 和 PKS 系统的腺苷酰化和酰基转移酶活性进行分类。

Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models.

机构信息

Center for Molecular and Biomolecular Informatics, Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands.

出版信息

PLoS One. 2013 Apr 18;8(4):e62136. doi: 10.1371/journal.pone.0062136. Print 2013.

DOI:10.1371/journal.pone.0062136

PMID:23637983

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3630128/

Abstract

There is a growing interest in the Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) of microbes, fungi and plants because they can produce bioactive peptides such as antibiotics. The ability to identify the substrate specificity of the enzyme's adenylation (A) and acyl-transferase (AT) domains is essential to rationally deduce or engineer new products. We here report on a Hidden Markov Model (HMM)-based ensemble method to predict the substrate specificity at high quality. We collected a new reference set of experimentally validated sequences. An initial classification based on alignment and Neighbor Joining was performed in line with most of the previously published prediction methods. We then created and tested single substrate specific HMMs and found that their use improved the correct identification significantly for A as well as for AT domains. A major advantage of the use of HMMs is that it abolishes the dependency on multiple sequence alignment and residue selection that is hampering the alignment-based clustering methods. Using our models we obtained a high prediction quality for the substrate specificity of the A domains similar to two recently published tools that make use of HMMs or Support Vector Machines (NRPSsp and NRPS predictor2, respectively). Moreover, replacement of the single substrate specific HMMs by ensembles of models caused a clear increase in prediction quality. We argue that the superiority of the ensemble over the single model is caused by the way substrate specificity evolves for the studied systems. It is likely that this also holds true for other protein domains. The ensemble predictor has been implemented in a simple web-based tool that is available at http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/.

摘要

人们对微生物、真菌和植物中的非核糖体肽合成酶（NRPSs）和聚酮合酶（PKSs）越来越感兴趣，因为它们可以产生抗生素等生物活性肽。鉴定酶的腺苷酰化（A）和酰基转移酶（AT）结构域的底物特异性的能力对于合理推断或设计新产品至关重要。我们在此报告了一种基于隐马尔可夫模型（HMM）的集成方法，可以高质量地预测底物特异性。我们收集了一组新的经过实验验证的序列作为参考集。根据大多数先前发表的预测方法，我们首先进行了基于比对和邻接法的初始分类。然后，我们创建并测试了单底物特异性 HMM，并发现它们的使用显著提高了 A 结构域和 AT 结构域的正确识别率。HMM 的一个主要优势是它消除了对多序列比对和残基选择的依赖，而这正是阻碍基于比对聚类方法的因素。使用我们的模型，我们获得了与最近发表的两种使用 HMM 或支持向量机（分别为 NRPSsp 和 NRPS predictor2）的工具相似的 A 结构域底物特异性的高预测质量。此外，用模型的集合替换单底物特异性 HMM 会明显提高预测质量。我们认为，集合优于单个模型的原因是研究系统中底物特异性的演变方式。对于其他蛋白质结构域，这很可能也是如此。该集成预测器已在一个简单的基于网络的工具中实现，可在 http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d81/3630128/12f78600940a/pone.0062136.g001.jpg

相似文献

Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models.使用底物特异性隐马尔可夫模型集合对 NRPS 和 PKS 系统的腺苷酰化和酰基转移酶活性进行分类。

PLoS One. 2013 Apr 18;8(4):e62136. doi: 10.1371/journal.pone.0062136. Print 2013.

Alignment-Free Methods for the Detection and Specificity Prediction of Adenylation Domains.用于腺苷酸化结构域检测和特异性预测的无比对方法

Methods Mol Biol. 2016;1401:253-72. doi: 10.1007/978-1-4939-3375-4_16.

SBSPKS: structure based sequence analysis of polyketide synthases.基于结构的聚酮合酶序列分析（SBSPKS）。

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W487-96. doi: 10.1093/nar/gkq340. Epub 2010 May 5.

Type I pyridoxal 5'-phosphate dependent enzymatic domains embedded within multimodular nonribosomal peptide synthetase and polyketide synthase assembly lines.嵌入多模块非核糖体肽合成酶和聚酮化合物合成酶装配线中的I型依赖磷酸吡哆醛的酶结构域。

BMC Struct Biol. 2013 Oct 23;13:26. doi: 10.1186/1472-6807-13-26.

Type I polyketide synthases that require discrete acyltransferases.需要离散酰基转移酶的I型聚酮合酶。

Methods Enzymol. 2009;459:165-86. doi: 10.1016/S0076-6879(09)04608-4.

In silico analysis of methyltransferase domains involved in biosynthesis of secondary metabolites.参与次生代谢物生物合成的甲基转移酶结构域的计算机分析

BMC Bioinformatics. 2008 Oct 25;9:454. doi: 10.1186/1471-2105-9-454.

Insights into protein-protein and enzyme-substrate interactions in modular polyketide synthases.模块聚酮合酶中蛋白质-蛋白质和酶-底物相互作用的见解

Chem Biol. 2010 Jul 30;17(7):705-16. doi: 10.1016/j.chembiol.2010.05.017.

Prediction of the substrate for nonribosomal peptide synthetase (NRPS) adenylation domains by virtual screening.通过虚拟筛选预测非核糖体肽合成酶（NRPS）腺苷化结构域的底物

Proteins. 2015 Nov;83(11):2052-66. doi: 10.1002/prot.24922. Epub 2015 Sep 28.

Specificity prediction of adenylation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs).使用转导支持向量机（TSVM）预测非核糖体肽合成酶（NRPS）中腺苷化结构域的特异性

Nucleic Acids Res. 2005 Oct 12;33(18):5799-808. doi: 10.1093/nar/gki885. Print 2005.

NRPSpredictor2--a web server for predicting NRPS adenylation domain specificity.NRPSpredictor2--一个用于预测 NRPS 腺苷酸结构域特异性的网络服务器。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W362-7. doi: 10.1093/nar/gkr323. Epub 2011 May 9.

引用本文的文献

Fatty acyl-AMP ligases in bacterial natural product biosynthesis.细菌天然产物生物合成中的脂肪酰-AMP连接酶

Nat Prod Rep. 2025 Apr 16;42(4):739-753. doi: 10.1039/d4np00073k.

Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways.基于特征序列的基因组挖掘揭示了细菌铁载体途径的隐藏多样性。

Elife. 2024 Oct 1;13:RP96719. doi: 10.7554/eLife.96719.

Genome Mining for Diazo-Synthesis-Related Genes in sp. CS057 Unveiled the Cryptic Biosynthetic Gene Cluster for the Novel 3,4-AHBA-Derived Compound Crexazone 2.从 sp. CS057 中进行基因组挖掘揭示了新型 3,4-AHBA 衍生化合物 Crexazone 2 的隐藏生物合成基因簇

Biomolecules. 2024 Aug 29;14(9):1084. doi: 10.3390/biom14091084.

Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases.非核糖体肽合成酶中腺苷酸结构域的功能多样性与工程改造。

Mar Drugs. 2024 Jul 29;22(8):349. doi: 10.3390/md22080349.

Non-ribosomal peptide synthetase (NRPS)-encoding products and their biosynthetic logics in Fusarium.非核糖体肽合成酶（NRPS）编码产物及其在镰刀菌中的生物合成逻辑。

Microb Cell Fact. 2024 Mar 27;23(1):93. doi: 10.1186/s12934-024-02378-1.

Structural, biochemical and bioinformatic analyses of nonribosomal peptide synthetase adenylation domains.非核糖体肽合成酶腺苷化结构域的结构、生化及生物信息学分析

Nat Prod Rep. 2024 Jul 17;41(7):1180-1205. doi: 10.1039/d3np00064h.

Knowledge-guided data mining on the standardized architecture of NRPS: Subtypes, novel motifs, and sequence entanglements.基于 NRPS 标准化结构的知识引导数据挖掘：亚型、新基序和序列缠绕。

PLoS Comput Biol. 2023 May 15;19(5):e1011100. doi: 10.1371/journal.pcbi.1011100. eCollection 2023 May.

Unique Initiation and Termination Mechanisms Involved in the Biosynthesis of a Hybrid Polyketide-Nonribosomal Peptide Lyngbyapeptin B Produced by the Marine Cyanobacterium .海洋蓝细菌产生的杂合聚酮-非核糖体肽 Lyngbyapeptin B 生物合成中涉及的独特起始和终止机制。

ACS Chem Biol. 2023 Apr 21;18(4):875-883. doi: 10.1021/acschembio.3c00011. Epub 2023 Mar 15.

Bioinformatic Analysis Reveals both Oversampled and Underexplored Biosynthetic Diversity in Nonribosomal Peptides.生物信息学分析揭示了非核糖体肽中的过采样和未充分探索的生物合成多样性。

ACS Chem Biol. 2023 Mar 17;18(3):476-483. doi: 10.1021/acschembio.2c00761. Epub 2023 Feb 23.

Endofungal bacteria boost anthelminthic host protection with the biosurfactant symbiosin.真菌内细菌通过生物表面活性剂共生菌素增强宿主对驱虫药的保护作用。

Chem Sci. 2022 Nov 21;14(1):103-112. doi: 10.1039/d2sc04167g. eCollection 2022 Dec 21.

本文引用的文献

Anaerobic bacteria as producers of antibiotics.厌氧细菌作为抗生素的生产者。

Appl Microbiol Biotechnol. 2012 Oct;96(1):61-7. doi: 10.1007/s00253-012-4285-8. Epub 2012 Aug 2.

Combinatorial biosynthesis of polyketides--a perspective.聚酮化合物的组合生物合成——一个展望。

Curr Opin Chem Biol. 2012 Apr;16(1-2):117-23. doi: 10.1016/j.cbpa.2012.01.018. Epub 2012 Feb 16.

Isolation and total synthesis of icumazoles and noricumazoles--antifungal antibiotics and cation-channel blockers from Sorangium cellulosum.纤维堆囊菌中抗真菌抗生素及阳离子通道阻滞剂伊库马唑和去甲伊库马唑的分离与全合成

Angew Chem Int Ed Engl. 2012 Jan 27;51(5):1256-60. doi: 10.1002/anie.201106435. Epub 2011 Dec 23.

GenBank.GenBank。

Nucleic Acids Res. 2012 Jan;40(Database issue):D48-53. doi: 10.1093/nar/gkr1202. Epub 2011 Dec 5.

NRPSsp: non-ribosomal peptide synthase substrate predictor.NRPSsp：非核糖体肽合成酶底物预测器。

Bioinformatics. 2012 Feb 1;28(3):426-7. doi: 10.1093/bioinformatics/btr659. Epub 2011 Nov 29.

Reorganizing the protein space at the Universal Protein Resource (UniProt).重新组织通用蛋白质资源库（UniProt）中的蛋白质空间。

Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. doi: 10.1093/nar/gkr981. Epub 2011 Nov 18.

Diversity and impact of prokaryotic toxins on aquatic environments: a review.原核生物毒素的多样性及其对水生环境的影响：综述。

Toxins (Basel). 2010 Oct;2(10):2359-410. doi: 10.3390/toxins2102359. Epub 2010 Oct 18.

NRPSpredictor2--a web server for predicting NRPS adenylation domain specificity.NRPSpredictor2--一个用于预测 NRPS 腺苷酸结构域特异性的网络服务器。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W362-7. doi: 10.1093/nar/gkr323. Epub 2011 May 9.

Cytotoxic tetramic acid derivative produced by a plant type-III polyketide synthase.植物型 III 聚酮合酶产生的细胞毒性四氢酸衍生物。

J Am Chem Soc. 2011 Apr 6;133(13):4746-9. doi: 10.1021/ja2006737. Epub 2011 Mar 10.

Insights into the complex biosynthesis of the leupyrrins in Sorangium cellulosum So ce690.对纤维素堆囊菌So ce690中亮紫红素复杂生物合成的见解。

Mol Biosyst. 2011 May;7(5):1549-63. doi: 10.1039/c0mb00240b. Epub 2011 Mar 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用底物特异性隐马尔可夫模型集合对 NRPS 和 PKS 系统的腺苷酰化和酰基转移酶活性进行分类。

Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献