快速催化模板搜索作为一种酶功能预测方法。

Rapid catalytic template searching as an enzyme function prediction procedure.

机构信息

Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, United States of America.

出版信息

PLoS One. 2013 May 10;8(5):e62535. doi: 10.1371/journal.pone.0062535. Print 2013.

DOI:10.1371/journal.pone.0062535

PMID:23675414

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3651201/

Abstract

We present an enzyme protein function identification algorithm, Catalytic Site Identification (CatSId), based on identification of catalytic residues. The method is optimized for highly accurate template identification across a diverse template library and is also very efficient in regards to time and scalability of comparisons. The algorithm matches three-dimensional residue arrangements in a query protein to a library of manually annotated, catalytic residues--The Catalytic Site Atlas (CSA). Two main processes are involved. The first process is a rapid protein-to-template matching algorithm that scales quadratically with target protein size and linearly with template size. The second process incorporates a number of physical descriptors, including binding site predictions, in a logistic scoring procedure to re-score matches found in Process 1. This approach shows very good performance overall, with a Receiver-Operator-Characteristic Area Under Curve (AUC) of 0.971 for the training set evaluated. The procedure is able to process cofactors, ions, nonstandard residues, and point substitutions for residues and ions in a robust and integrated fashion. Sites with only two critical (catalytic) residues are challenging cases, resulting in AUCs of 0.9411 and 0.5413 for the training and test sets, respectively. The remaining sites show excellent performance with AUCs greater than 0.90 for both the training and test data on templates of size greater than two critical (catalytic) residues. The procedure has considerable promise for larger scale searches.

摘要

我们提出了一种酶蛋白功能鉴定算法，即 Catalytic Site Identification (CatSId)，它基于催化残基的鉴定。该方法针对跨多样化模板库进行高度准确的模板识别进行了优化，并且在比较的时间和可扩展性方面也非常高效。该算法将查询蛋白中的三维残基排列与手动注释的催化残基库（Catalytic Site Atlas，CSA）进行匹配。该算法涉及两个主要过程。第一个过程是一种快速的蛋白质到模板匹配算法，其规模与目标蛋白大小呈二次方关系，与模板大小呈线性关系。第二个过程在逻辑评分过程中结合了许多物理描述符，包括结合位点预测，以重新评分过程 1 中找到的匹配。该方法总体性能非常好，在评估的训练集中，接收器操作特征曲线（ROC）下的面积（AUC）为 0.971。该过程能够以稳健且集成的方式处理辅助因子、离子、非标准残基以及残基和离子的点取代。只有两个关键（催化）残基的位点是具有挑战性的情况，导致训练集和测试集的 AUC 分别为 0.9411 和 0.5413。对于大于两个关键（催化）残基的模板的训练和测试数据，其余位点的 AUC 均大于 0.90，表现出出色的性能。该过程在更大规模的搜索中具有很大的应用前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a29d/3651201/4c82aeefbada/pone.0062535.g001.jpg

相似文献

Rapid catalytic template searching as an enzyme function prediction procedure.快速催化模板搜索作为一种酶功能预测方法。

PLoS One. 2013 May 10;8(5):e62535. doi: 10.1371/journal.pone.0062535. Print 2013.

CSmetaPred: a consensus method for prediction of catalytic residues.CSmetaPred：一种预测催化残基的共识方法。

BMC Bioinformatics. 2017 Dec 22;18(1):583. doi: 10.1186/s12859-017-1987-z.

A matching algorithm for catalytic residue site selection in computational enzyme design.催化残基位点选择的计算酶设计匹配算法。

Protein Sci. 2011 Sep;20(9):1566-75. doi: 10.1002/pro.685. Epub 2011 Jul 29.

Networks of high mutual information define the structural proximity of catalytic sites: implications for catalytic residue identification.高互信息网络定义了催化位点的结构邻近性：对催化残基识别的影响。

PLoS Comput Biol. 2010 Nov 4;6(11):e1000978. doi: 10.1371/journal.pcbi.1000978.

Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.利用THEMATICS和支持向量机提高蛋白质活性位点预测性能。

Protein Sci. 2008 Feb;17(2):333-41. doi: 10.1110/ps.073213608. Epub 2007 Dec 20.

A fast loop-closure algorithm to accelerate residue matching in computational enzyme design.一种用于加速计算酶设计中残基匹配的快速闭环算法。

J Mol Model. 2016 Feb;22(2):49. doi: 10.1007/s00894-016-2915-2. Epub 2016 Jan 29.

Identification of catalytic residues from protein structure using support vector machine with sequence and structural features.利用具有序列和结构特征的支持向量机从蛋白质结构中鉴定催化残基。

Biochem Biophys Res Commun. 2008 Mar 14;367(3):630-4. doi: 10.1016/j.bbrc.2008.01.038. Epub 2008 Jan 17.

The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data.催化位点图谱：一个利用结构数据在酶中鉴定出的催化位点和残基的资源库。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D129-33. doi: 10.1093/nar/gkh028.

PINGU: PredIction of eNzyme catalytic residues usinG seqUence information.PINGU：利用序列信息预测酶催化残基

PLoS One. 2015 Aug 11;10(8):e0135122. doi: 10.1371/journal.pone.0135122. eCollection 2015.

An improved prediction of catalytic residues in enzyme structures.酶结构中催化残基的改进预测。

Protein Eng Des Sel. 2008 May;21(5):295-302. doi: 10.1093/protein/gzn003. Epub 2008 Feb 20.

引用本文的文献

Enzyme function and evolution through the lens of bioinformatics.通过生物信息学的视角研究酶的功能和进化。

Biochem J. 2023 Nov 29;480(22):1845-1863. doi: 10.1042/BCJ20220405.

Conformational Variation in Enzyme Catalysis: A Structural Study on Catalytic Residues.酶催化中的构象变化：催化残基的结构研究。

J Mol Biol. 2022 Apr 15;434(7):167517. doi: 10.1016/j.jmb.2022.167517. Epub 2022 Feb 28.

System-level analysis of metabolic trade-offs during anaerobic photoheterotrophic growth in Rhodopseudomonas palustris.沼泽红假单胞菌厌氧光合异养生长过程中的代谢权衡的系统水平分析。

BMC Bioinformatics. 2019 May 9;20(1):233. doi: 10.1186/s12859-019-2844-z.

CSmetaPred: a consensus method for prediction of catalytic residues.CSmetaPred：一种预测催化残基的共识方法。

BMC Bioinformatics. 2017 Dec 22;18(1):583. doi: 10.1186/s12859-017-1987-z.

Protein structural motifs in prediction and design.预测与设计中的蛋白质结构基序

Curr Opin Struct Biol. 2017 Jun;44:161-167. doi: 10.1016/j.sbi.2017.03.012. Epub 2017 Apr 28.

GASS-WEB: a web server for identifying enzyme active sites based on genetic algorithms.GASS-WEB：一个基于遗传算法的酶活性位点预测的网络服务器。

Nucleic Acids Res. 2017 Jul 3;45(W1):W315-W319. doi: 10.1093/nar/gkx337.

A Monoclonal Antibody to Cryptococcus neoformans Glucuronoxylomannan Manifests Hydrolytic Activity for Both Peptides and Polysaccharides.一种针对新型隐球菌葡糖醛酸木甘露聚糖的单克隆抗体对肽和多糖均表现出水解活性。

J Biol Chem. 2017 Jan 13;292(2):417-434. doi: 10.1074/jbc.M116.767582. Epub 2016 Nov 21.

Exploring Human Diseases and Biological Mechanisms by Protein Structure Prediction and Modeling.通过蛋白质结构预测和建模探索人类疾病与生物学机制。

Adv Exp Med Biol. 2016;939:39-61. doi: 10.1007/978-981-10-1503-8_3.

Considerations of Protein Subpockets in Fragment-Based Drug Design.基于片段的药物设计中蛋白质亚口袋的考量

Chem Biol Drug Des. 2016 Jan;87(1):5-20. doi: 10.1111/cbdd.12631. Epub 2015 Aug 31.

Template-based prediction of protein function.基于模板的蛋白质功能预测。

Curr Opin Struct Biol. 2015 Jun;32:33-8. doi: 10.1016/j.sbi.2015.01.007. Epub 2015 Feb 10.

本文引用的文献

Biophysics (Nagoya-shi). 2007 Dec 28;3:75-84. doi: 10.2142/biophysics.3.75. eCollection 2007.

SABER: a computational method for identifying active sites for new reactions.SABER：一种用于识别新反应活性位点的计算方法。

Protein Sci. 2012 May;21(5):697-706. doi: 10.1002/pro.2055.

Divergent evolution in enolase superfamily: strategies for assigning functions.烯醇酶超家族的趋异进化：功能分配策略。

J Biol Chem. 2012 Jan 2;287(1):29-34. doi: 10.1074/jbc.R111.240945. Epub 2011 Nov 8.

The Enzyme Function Initiative.酶功能倡议。

Biochemistry. 2011 Nov 22;50(46):9950-62. doi: 10.1021/bi201312u. Epub 2011 Oct 26.

The LabelHash algorithm for substructure matching.LabelHash 算法用于子结构匹配。

BMC Bioinformatics. 2010 Nov 11;11:555. doi: 10.1186/1471-2105-11-555.

SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison.SMAP-WS：一种用于结构蛋白质组范围配体结合位点比较的并行网络服务。

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W441-4. doi: 10.1093/nar/gkq400. Epub 2010 May 19.

Active site prediction using evolutionary and structural information.利用进化和结构信息进行活性位点预测。

Bioinformatics. 2010 Mar 1;26(5):617-24. doi: 10.1093/bioinformatics/btq008. Epub 2010 Jan 14.

Fast determination of the optimal rotational matrix for macromolecular superpositions.快速确定大分子叠加的最佳旋转矩阵。

J Comput Chem. 2010 May;31(7):1561-3. doi: 10.1002/jcc.21439.

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.一种支持对配体结合位点进行局部序列顺序无关相似性搜索的统一统计模型及其在基于基因组的药物发现中的应用。

Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220.

Identifying and characterizing binding sites and assessing druggability.识别和表征结合位点并评估成药性。

J Chem Inf Model. 2009 Feb;49(2):377-89. doi: 10.1021/ci800324m.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

快速催化模板搜索作为一种酶功能预测方法。

Rapid catalytic template searching as an enzyme function prediction procedure.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献