• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ModEnzA:使用具有优化判别阈值和修正发射概率的功能特异性隐马尔可夫模型精确识别代谢酶。

ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities.

作者信息

Desai Dhwani K, Nandi Soumyadeep, Srivastava Prashant K, Lynn Andrew M

机构信息

Biological Oceanography Division, Leibniz Institute of Marine Sciences (IFM-GEOMAR), Düsternbrooker Weg 20, 24105 Kiel, Germany.

出版信息

Adv Bioinformatics. 2011;2011:743782. doi: 10.1155/2011/743782. Epub 2011 Mar 29.

DOI:10.1155/2011/743782
PMID:21541071
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3085309/
Abstract

Various enzyme identification protocols involving homology transfer by sequence-sequence or profile-sequence comparisons have been devised which utilise Swiss-Prot sequences associated with EC numbers as the training set. A profile HMM constructed for a particular EC number might select sequences which perform a different enzymatic function due to the presence of certain fold-specific residues which are conserved in enzymes sharing a common fold. We describe a protocol, ModEnzA (HMM-ModE Enzyme Annotation), which generates profile HMMs highly specific at a functional level as defined by the EC numbers by incorporating information from negative training sequences. We enrich the training dataset by mining sequences from the NCBI Non-Redundant database for increased sensitivity. We compare our method with other enzyme identification methods, both for assigning EC numbers to a genome as well as identifying protein sequences associated with an enzymatic activity. We report a sensitivity of 88% and specificity of 95% in identifying EC numbers and annotating enzymatic sequences from the E. coli genome which is higher than any other method. With the next-generation sequencing methods producing a huge amount of sequence data, the development and use of fully automated yet accurate protocols such as ModEnzA is warranted for rapid annotation of newly sequenced genomes and metagenomic sequences.

摘要

已经设计了各种酶识别方案,这些方案通过序列-序列或轮廓-序列比较进行同源性转移,利用与酶委员会(EC)编号相关的瑞士蛋白质数据库(Swiss-Prot)序列作为训练集。为特定EC编号构建的轮廓隐马尔可夫模型(profile HMM)可能会选择由于存在某些在具有共同折叠的酶中保守的折叠特异性残基而执行不同酶功能的序列。我们描述了一种方案,即ModEnzA(HMM-ModE酶注释),它通过纳入来自阴性训练序列的信息,生成在EC编号定义的功能水平上具有高度特异性的轮廓HMM。我们通过从NCBI非冗余数据库挖掘序列来丰富训练数据集,以提高灵敏度。我们将我们的方法与其他酶识别方法进行比较,包括为基因组分配EC编号以及识别与酶活性相关的蛋白质序列。我们报告在识别EC编号和注释大肠杆菌基因组中的酶序列时,灵敏度为88%,特异性为95%,高于任何其他方法。随着下一代测序方法产生大量序列数据,像ModEnzA这样的全自动且准确的方案的开发和使用对于新测序基因组和宏基因组序列的快速注释是必要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1298/3085309/345d730c303e/ABI2011-743782.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1298/3085309/6c7bdf9fe3db/ABI2011-743782.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1298/3085309/345d730c303e/ABI2011-743782.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1298/3085309/6c7bdf9fe3db/ABI2011-743782.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1298/3085309/345d730c303e/ABI2011-743782.002.jpg

相似文献

1
ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities.ModEnzA:使用具有优化判别阈值和修正发射概率的功能特异性隐马尔可夫模型精确识别代谢酶。
Adv Bioinformatics. 2011;2011:743782. doi: 10.1155/2011/743782. Epub 2011 Mar 29.
2
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
3
HMM-ModE: implementation, benchmarking and validation with HMMER3.HMM-ModE:使用HMMER3进行实现、基准测试和验证。
BMC Res Notes. 2014 Jul 30;7:483. doi: 10.1186/1756-0500-7-483.
4
Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study.基于同源性和非同源性的计算方法在孤儿酶的鉴定和注释中的应用:以结核分枝杆菌 H37Rv 为例。
BMC Bioinformatics. 2020 Oct 19;21(1):466. doi: 10.1186/s12859-020-03794-x.
5
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
6
BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation.BrEPS:一种用于计算酶特异性序列轮廓以进行功能注释的灵活自动协议。
BMC Bioinformatics. 2010 Dec 1;11:589. doi: 10.1186/1471-2105-11-589.
7
GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data.GenSeed-HMM:一种使用隐马尔可夫模型轮廓作为种子进行渐进式组装的工具及其在从宏基因组数据中发现阿尔帕病毒科病毒中的应用
Front Microbiol. 2016 Mar 4;7:269. doi: 10.3389/fmicb.2016.00269. eCollection 2016.
8
9
Hidden Markov models in computational biology. Applications to protein modeling.计算生物学中的隐马尔可夫模型。在蛋白质建模中的应用。
J Mol Biol. 1994 Feb 4;235(5):1501-31. doi: 10.1006/jmbi.1994.1104.
10
Data mining of enzymes using specific peptides.利用特定肽进行酶的数据挖掘。
BMC Bioinformatics. 2009 Dec 24;10:446. doi: 10.1186/1471-2105-10-446.

引用本文的文献

1
Accurately predicting enzyme functions through geometric graph learning on ESMFold-predicted structures.通过在 ESMFold 预测结构上进行几何图形学习,准确预测酶功能。
Nat Commun. 2024 Sep 18;15(1):8180. doi: 10.1038/s41467-024-52533-w.
2
DeepTM: A deep learning algorithm for prediction of melting temperature of thermophilic proteins directly from sequences.DeepTM:一种直接从序列预测嗜热蛋白解链温度的深度学习算法。
Comput Struct Biotechnol J. 2023 Nov 4;21:5544-5560. doi: 10.1016/j.csbj.2023.11.006. eCollection 2023.
3
Evidential deep learning for trustworthy prediction of enzyme commission number.

本文引用的文献

1
EFICAz2: enzyme function inference by a combined approach enhanced by machine learning.EFICAz2:通过机器学习增强的组合方法进行酶功能推断
BMC Bioinformatics. 2009 Apr 13;10:107. doi: 10.1186/1471-2105-10-107.
2
Application of 'next-generation' sequencing technologies to microbial genetics.“下一代”测序技术在微生物遗传学中的应用。
Nat Rev Microbiol. 2009 Apr;7(4):287-96. doi: 10.1038/nrmicro2122.
3
Quantifying environmental adaptation of metabolic pathways in metagenomics.宏基因组学中代谢途径环境适应性的量化
基于证据的深度学习方法可实现酶委员会编号的可靠预测。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad401.
4
Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study.基于同源性和非同源性的计算方法在孤儿酶的鉴定和注释中的应用:以结核分枝杆菌 H37Rv 为例。
BMC Bioinformatics. 2020 Oct 19;21(1):466. doi: 10.1186/s12859-020-03794-x.
5
Genomics-driven discovery of a biosynthetic gene cluster required for the synthesis of BII-Rafflesfungin from the fungus Phoma sp. F3723.基于基因组学的发现,一个生物合成基因簇对于真菌 Phoma sp. F3723 合成 BII-Rafflesfungin 是必需的。
BMC Genomics. 2019 May 14;20(1):374. doi: 10.1186/s12864-019-5762-6.
6
DomSign: a top-down annotation pipeline to enlarge enzyme space in the protein universe.DomSign:一种自上而下的注释流程,用于拓展蛋白质世界中的酶空间。
BMC Bioinformatics. 2015 Mar 21;16:96. doi: 10.1186/s12859-015-0499-y.
7
HMM-ModE: implementation, benchmarking and validation with HMMER3.HMM-ModE:使用HMMER3进行实现、基准测试和验证。
BMC Res Notes. 2014 Jul 30;7:483. doi: 10.1186/1756-0500-7-483.
8
Giant hydrogen sulfide plume in the oxygen minimum zone off Peru supports chemolithoautotrophy.秘鲁外氧最小层中巨型硫化氢羽流支持化能自养作用。
PLoS One. 2013 Aug 21;8(8):e68661. doi: 10.1371/journal.pone.0068661. eCollection 2013.
9
Enzyme informatics.酶信息学。
Curr Top Med Chem. 2012;12(17):1911-23. doi: 10.2174/156802612804547353.
10
Factors influencing the diversity of iron uptake systems in aquatic microorganisms.影响水生微生物铁摄取系统多样性的因素。
Front Microbiol. 2012 Oct 18;3:362. doi: 10.3389/fmicb.2012.00362. eCollection 2012.
Proc Natl Acad Sci U S A. 2009 Feb 3;106(5):1374-9. doi: 10.1073/pnas.0808022106. Epub 2009 Jan 22.
4
The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.代谢途径与酶的MetaCyc数据库以及途径/基因组数据库的BioCyc集合。
Nucleic Acids Res. 2008 Jan;36(Database issue):D623-31. doi: 10.1093/nar/gkm900. Epub 2007 Oct 27.
5
Genome-wide fitness test and mechanism-of-action studies of inhibitory compounds in Candida albicans.白色念珠菌中抑制性化合物的全基因组适应性测试及作用机制研究
PLoS Pathog. 2007 Jun;3(6):e92. doi: 10.1371/journal.ppat.0030092.
6
Lumazine synthase from Candida albicans as an anti-fungal target enzyme: structural and biochemical basis for drug design.白色念珠菌的鲁玛嗪合酶作为抗真菌靶标酶:药物设计的结构和生化基础
J Biol Chem. 2007 Jun 8;282(23):17231-41. doi: 10.1074/jbc.M701724200. Epub 2007 Apr 18.
7
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
8
The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific.“魔法师二号”全球海洋采样探险:从西北大西洋到东热带太平洋
PLoS Biol. 2007 Mar;5(3):e77. doi: 10.1371/journal.pbio.0050077.
9
Effects of depleting the essential central metabolic enzyme fructose-1,6-bisphosphate aldolase on the growth and viability of Candida albicans: implications for antifungal drug target discovery.消耗必需的中心代谢酶果糖-1,6-二磷酸醛缩酶对白色念珠菌生长和活力的影响:对抗真菌药物靶点发现的启示
Eukaryot Cell. 2006 Aug;5(8):1371-7. doi: 10.1128/EC.00115-06.
10
Prioritizing genomic drug targets in pathogens: application to Mycobacterium tuberculosis.确定病原体中的基因组药物靶点:在结核分枝杆菌中的应用
PLoS Comput Biol. 2006 Jun 9;2(6):e61. doi: 10.1371/journal.pcbi.0020061.