• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用THEMATICS和支持向量机提高蛋白质活性位点预测性能。

Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.

作者信息

Tong Wenxu, Williams Ronald J, Wei Ying, Murga Leonel F, Ko Jaeju, Ondrechen Mary Jo

机构信息

College of Computer and Information Science, Northeastern University, Boston, Massachusetts 02115, USA.

出版信息

Protein Sci. 2008 Feb;17(2):333-41. doi: 10.1110/ps.073213608. Epub 2007 Dec 20.

DOI:10.1110/ps.073213608
PMID:18096640
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2222725/
Abstract

Theoretical microscopic titration curves (THEMATICS) is a computational method for the identification of active sites in proteins through deviations in computed titration behavior of ionizable residues. While the sensitivity to catalytic sites is high, the previously reported sensitivity to catalytic residues was not as high, about 50%. Here THEMATICS is combined with support vector machines (SVM) to improve sensitivity for catalytic residue prediction from protein 3D structure alone. For a test set of 64 proteins taken from the Catalytic Site Atlas (CSA), the average recall rate for annotated catalytic residues is 61%; good precision is maintained selecting only 4% of all residues. The average false positive rate, using the CSA annotations is only 3.2%, far lower than other 3D-structure-based methods. THEMATICS-SVM returns higher precision, lower false positive rate, and better overall performance, compared with other 3D-structure-based methods. Comparison is also made with the latest machine learning methods that are based on both sequence alignments and 3D structures. For annotated sets of well-characterized enzymes, THEMATICS-SVM performance compares very favorably with methods that utilize sequence homology. However, since THEMATICS depends only on the 3D structure of the query protein, no decline in performance is expected when applied to novel folds, proteins with few sequence homologues, or even orphan sequences. An extension of the method to predict non-ionizable catalytic residues is also presented. THEMATICS-SVM predicts a local network of ionizable residues with strong interactions between protonation events; this appears to be a special feature of enzyme active sites.

摘要

理论微观滴定曲线(THEMATICS)是一种通过可电离残基计算滴定行为的偏差来识别蛋白质活性位点的计算方法。虽然对催化位点的敏感性较高,但先前报道的对催化残基的敏感性并不高,约为50%。在此,THEMATICS与支持向量机(SVM)相结合,以提高仅从蛋白质三维结构预测催化残基的敏感性。对于从催化位点图谱(CSA)中选取的64种蛋白质的测试集,注释催化残基的平均召回率为61%;仅选择所有残基的4%时仍能保持良好的精度。使用CSA注释的平均误报率仅为3.2%,远低于其他基于三维结构的方法。与其他基于三维结构的方法相比,THEMATICS-SVM具有更高的精度、更低的误报率和更好的整体性能。还与基于序列比对和三维结构的最新机器学习方法进行了比较。对于注释良好的酶集,THEMATICS-SVM的性能与利用序列同源性的方法相比非常有利。然而,由于THEMATICS仅依赖于查询蛋白质的三维结构,因此应用于新折叠、序列同源物较少的蛋白质甚至孤儿序列时,预计性能不会下降。还介绍了该方法对不可电离催化残基预测的扩展。THEMATICS-SVM预测了可电离残基的局部网络,质子化事件之间存在强烈相互作用;这似乎是酶活性位点的一个特殊特征。

相似文献

1
Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.利用THEMATICS和支持向量机提高蛋白质活性位点预测性能。
Protein Sci. 2008 Feb;17(2):333-41. doi: 10.1110/ps.073213608. Epub 2007 Dec 20.
2
Computed protonation properties: unique capabilities for protein functional site prediction.计算质子化特性:蛋白质功能位点预测的独特能力。
Genome Inform. 2007;19:107-18.
3
Selective prediction of interaction sites in protein structures with THEMATICS.利用THEMATICS对蛋白质结构中的相互作用位点进行选择性预测。
BMC Bioinformatics. 2007 Apr 9;8:119. doi: 10.1186/1471-2105-8-119.
4
Partial order optimum likelihood (POOL): maximum likelihood prediction of protein active site residues using 3D Structure and sequence properties.偏序最优似然法(POOL):利用三维结构和序列特性对蛋白质活性位点残基进行最大似然预测。
PLoS Comput Biol. 2009 Jan;5(1):e1000266. doi: 10.1371/journal.pcbi.1000266. Epub 2009 Jan 16.
5
Active site prediction for comparative model structures with thematics.具有主题的比较模型结构的活性位点预测。
J Bioinform Comput Biol. 2005 Feb;3(1):127-43. doi: 10.1142/s0219720005000916.
6
High-performance prediction of functional residues in proteins with machine learning and computed input features.基于机器学习和计算输入特征的蛋白质功能残基的高效预测。
Biopolymers. 2011 Jun;95(6):390-400. doi: 10.1002/bip.21589.
7
Prediction of interaction sites from apo 3D structures when the holo conformation is different.当全酶构象不同时,从无配体三维结构预测相互作用位点。
Proteins. 2008 Aug 15;72(3):980-92. doi: 10.1002/prot.21995.
8
Prediction of active sites for protein structures from computed chemical properties.根据计算化学性质预测蛋白质结构的活性位点。
Bioinformatics. 2005 Jun;21 Suppl 1:i258-65. doi: 10.1093/bioinformatics/bti1039.
9
POOL server: machine learning application for functional site prediction in proteins.POOL 服务器:用于蛋白质功能位点预测的机器学习应用程序。
Bioinformatics. 2012 Aug 1;28(15):2078-9. doi: 10.1093/bioinformatics/bts321. Epub 2012 Jun 1.
10
Functional classification of protein 3D structures from predicted local interaction sites.基于预测的局部相互作用位点对蛋白质三维结构进行功能分类。
J Bioinform Comput Biol. 2010 Dec;8 Suppl 1:1-15. doi: 10.1142/s0219720010005166.

引用本文的文献

1
Biochemical Activity of 17 Cancer-Associated Variants of DNA Polymerase Kappa Predicted by Electrostatic Properties.DNA 聚合酶 κ 的 17 种与癌症相关的变异体的静电特性预测的生化活性。
Chem Res Toxicol. 2023 Nov 20;36(11):1789-1803. doi: 10.1021/acs.chemrestox.3c00233. Epub 2023 Oct 26.
2
Integrated Bioinformatics-Based Subtractive Genomics Approach to Decipher the Therapeutic Drug Target and Its Possible Intervention against Brucellosis.基于综合生物信息学的消减基因组学方法来破译治疗药物靶点及其对布鲁氏菌病的可能干预措施。
Bioengineering (Basel). 2022 Nov 1;9(11):633. doi: 10.3390/bioengineering9110633.
3
Probing remote residues important for catalysis in Escherichia coli ornithine transcarbamoylase.探究大肠杆菌鸟氨酸转氨甲酰酶中对催化作用重要的远程残基。
PLoS One. 2020 Feb 6;15(2):e0228487. doi: 10.1371/journal.pone.0228487. eCollection 2020.
4
Tailoring Proteins to Re-Evolve Nature: A Short Review.定制蛋白质以重塑自然:一篇短文综述。
Mol Biotechnol. 2018 Dec;60(12):946-974. doi: 10.1007/s12033-018-0122-3.
5
Sequence-based prediction of physicochemical interactions at protein functional sites using a function-and-interaction-annotated domain profile database.基于序列的蛋白质功能位点物理化学相互作用预测,使用功能和相互作用注释的结构域图谱数据库。
BMC Bioinformatics. 2018 Jun 1;19(1):204. doi: 10.1186/s12859-018-2206-2.
6
Functional classification of protein structures by local structure matching in graph representation.基于图表示的局部结构匹配对蛋白质结构进行功能分类。
Protein Sci. 2018 Jun;27(6):1125-1135. doi: 10.1002/pro.3416. Epub 2018 Apr 27.
7
Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.通过结合高通量适应性图谱和同源结构分析对蛋白质功能残基进行注释
mBio. 2016 Nov 1;7(6):e01801-16. doi: 10.1128/mBio.01801-16.
8
Cutoff lensing: predicting catalytic sites in enzymes.截止透镜法:预测酶中的催化位点。
Sci Rep. 2015 Oct 8;5:14874. doi: 10.1038/srep14874.
9
iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).iCataly-PseAAC:基于灰色模型GM(2,1)利用序列进化信息识别酶的催化位点
J Membr Biol. 2015 Dec;248(6):1033-41. doi: 10.1007/s00232-015-9815-8. Epub 2015 Jun 16.
10
A survey of computational intelligence techniques in protein function prediction.蛋白质功能预测中的计算智能技术综述。
Int J Proteomics. 2014;2014:845479. doi: 10.1155/2014/845479. Epub 2014 Dec 11.

本文引用的文献

1
The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin.用于蛋白质的OPLS(液体模拟优化势)势函数、环肽和克拉宾晶体的能量最小化。
J Am Chem Soc. 1988 Mar 1;110(6):1657-66. doi: 10.1021/ja00214a001.
2
Structure-based activity prediction for an enzyme of unknown function.基于结构的未知功能酶活性预测
Nature. 2007 Aug 16;448(7155):775-9. doi: 10.1038/nature05981. Epub 2007 Jul 1.
3
Selective prediction of interaction sites in protein structures with THEMATICS.利用THEMATICS对蛋白质结构中的相互作用位点进行选择性预测。
BMC Bioinformatics. 2007 Apr 9;8:119. doi: 10.1186/1471-2105-8-119.
4
Evaluation of features for catalytic residue prediction in novel folds.新型折叠中催化残基预测特征的评估。
Protein Sci. 2007 Feb;16(2):216-26. doi: 10.1110/ps.062523907. Epub 2006 Dec 22.
5
Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties.使用支持向量机结合选定的蛋白质序列和结构特性预测催化残基。
BMC Bioinformatics. 2006 Jun 21;7:312. doi: 10.1186/1471-2105-7-312.
6
Looking at enzymes from the inside out: the proximity of catalytic residues to the molecular centroid can be used for detection of active sites and enzyme-ligand interfaces.从内而外审视酶:催化残基与分子质心的接近程度可用于检测活性位点和酶-配体界面。
J Mol Biol. 2005 Aug 12;351(2):309-26. doi: 10.1016/j.jmb.2005.06.047.
7
Statistical criteria for the identification of protein active sites using Theoretical Microscopic Titration Curves.使用理论微观滴定曲线识别蛋白质活性位点的统计标准。
Proteins. 2005 May 1;59(2):183-95. doi: 10.1002/prot.20418.
8
Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.Q-SiteFinder:一种基于能量的蛋白质-配体结合位点预测方法。
Bioinformatics. 2005 May 1;21(9):1908-16. doi: 10.1093/bioinformatics/bti315. Epub 2005 Feb 8.
9
Network analysis of protein structures identifies functional residues.蛋白质结构的网络分析可识别功能残基。
J Mol Biol. 2004 Dec 3;344(4):1135-46. doi: 10.1016/j.jmb.2004.10.055.
10
Enzyme/non-enzyme discrimination and prediction of enzyme active site location using charge-based methods.使用基于电荷的方法进行酶/非酶区分以及酶活性位点位置预测。
J Mol Biol. 2004 Jul 2;340(2):263-76. doi: 10.1016/j.jmb.2004.04.070.