• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

催化位点预测中的序列保守性。

Sequence conservation in the prediction of catalytic sites.

机构信息

School of Mathematical Science, Dalian University of Technology, People's Republic of China.

出版信息

Protein J. 2011 Apr;30(4):229-39. doi: 10.1007/s10930-011-9324-2.

DOI:10.1007/s10930-011-9324-2
PMID:21465136
Abstract

Predicting catalytic sites of a given enzyme is an important open problem of Bioinformatics. Recently, many machine learning-based methods have been developed which have the advantage that they can account for many sequential or structural features. We found that although many kinds of features are incorporated, protein sequence conservation is the main part of information they used and should play an important role in the future. So we tested several conservation features in their ability to predict catalytic sites by using the Support Vector Machine classifier. Our results suggest that position specific scoring matrix performs better than other features and incorporating conservation information of sequentially adjacent sites is more effective than that of structurally adjacent ones. Moreover, although conservation information is effective in predicting catalytic sites, it is a difficult problem to optimize the combination of conservation features and other ones.

摘要

预测给定酶的催化位点是生物信息学中的一个重要开放性问题。最近,已经开发了许多基于机器学习的方法,这些方法的优点是可以考虑许多序列或结构特征。我们发现,尽管包含了许多种类的特征,但蛋白质序列保守性是它们使用的信息的主要部分,并且应该在未来发挥重要作用。因此,我们使用支持向量机分类器测试了几种保守特征在预测催化位点方面的能力。我们的结果表明,位置特异性评分矩阵的性能优于其他特征,并且整合序列相邻位点的保守信息比结构相邻位点的保守信息更有效。此外,尽管保守信息在预测催化位点方面很有效,但优化保守特征与其他特征的组合是一个难题。

相似文献

1
Sequence conservation in the prediction of catalytic sites.催化位点预测中的序列保守性。
Protein J. 2011 Apr;30(4):229-39. doi: 10.1007/s10930-011-9324-2.
2
Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites.进化保守模式分析及其对蛋白质功能位点识别的影响。
J Bioinform Comput Biol. 2014 Oct;12(5):1440003. doi: 10.1142/S0219720014400034.
3
Prediction of lipid-binding sites based on support vector machine and position specific scoring matrix.基于支持向量机和位置特异性评分矩阵预测脂质结合位点。
Protein J. 2010 Aug;29(6):427-31. doi: 10.1007/s10930-010-9269-x.
4
Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties.使用支持向量机结合选定的蛋白质序列和结构特性预测催化残基。
BMC Bioinformatics. 2006 Jun 21;7:312. doi: 10.1186/1471-2105-7-312.
5
Prediction of rat protein subcellular localization with pseudo amino acid composition based on multiple sequential features.
Protein Pept Lett. 2011 Jun;18(6):625-33. doi: 10.2174/092986611795222768.
6
Accurate prediction of protein catalytic residues by side chain orientation and residue contact density.通过侧链方向和残基接触密度准确预测蛋白质催化残基。
PLoS One. 2012;7(10):e47951. doi: 10.1371/journal.pone.0047951. Epub 2012 Oct 24.
7
Identification of catalytic residues from protein structure using support vector machine with sequence and structural features.利用具有序列和结构特征的支持向量机从蛋白质结构中鉴定催化残基。
Biochem Biophys Res Commun. 2008 Mar 14;367(3):630-4. doi: 10.1016/j.bbrc.2008.01.038. Epub 2008 Jan 17.
8
Prediction of protein binding sites in protein structures using hidden Markov support vector machine.利用隐马尔可夫支持向量机预测蛋白质结构中的蛋白质结合位点。
BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381.
9
Predicting protein sumoylation sites from sequence features.预测蛋白质的 SUMO 化位点从序列特征。
Amino Acids. 2012 Jul;43(1):447-55. doi: 10.1007/s00726-011-1100-2. Epub 2011 Oct 7.
10
Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection.通过最大相关最小冗余(mRMR)特征选择预测酶的活性位点。
Mol Biosyst. 2013 Jan 27;9(1):61-9. doi: 10.1039/c2mb25327e. Epub 2012 Nov 2.

引用本文的文献

1
iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).iCataly-PseAAC:基于灰色模型GM(2,1)利用序列进化信息识别酶的催化位点
J Membr Biol. 2015 Dec;248(6):1033-41. doi: 10.1007/s00232-015-9815-8. Epub 2015 Jun 16.
2
A new bioinformatics approach to natural protein collections: permutation structure contrasts of viral and cellular systems.一种新的生物信息学方法用于自然蛋白质集合:病毒和细胞系统的排列结构对比。
Protein J. 2013 Apr;32(4):275-87. doi: 10.1007/s10930-013-9485-2.
3
CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.

本文引用的文献

1
Relative von Neumann entropy for evaluating amino acid conservation.用于评估氨基酸保守性的相对冯·诺依曼熵
J Bioinform Comput Biol. 2010 Oct;8(5):809-23. doi: 10.1142/s021972001000494x.
2
CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation.CMASA:一种用于检测局部蛋白质结构相似性的精确算法及其在酶催化位点注释中的应用。
BMC Bioinformatics. 2010 Aug 27;11:439. doi: 10.1186/1471-2105-11-439.
3
The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand.
CLIPS-1D:分析多重序列比对,推断残基位置在催化、配体结合或蛋白质结构中的作用。
BMC Bioinformatics. 2012 Apr 5;13:55. doi: 10.1186/1471-2105-13-55.
与从未表达的肽基序相对应的寡脱氧核苷酸序列主要位于非编码链上。
BMC Bioinformatics. 2010 Jul 20;11:383. doi: 10.1186/1471-2105-11-383.
4
Prediction of protein-RNA binding sites by a random forest method with combined features.基于组合特征的随机森林方法预测蛋白质-RNA 结合位点。
Bioinformatics. 2010 Jul 1;26(13):1616-22. doi: 10.1093/bioinformatics/btq253. Epub 2010 May 18.
5
Prediction of catalytic residues based on an overlapping amino acid classification.基于重叠氨基酸分类的催化残基预测。
Amino Acids. 2010 Nov;39(5):1353-61. doi: 10.1007/s00726-010-0587-2. Epub 2010 Apr 10.
6
Automatic prediction of catalytic residues by modeling residue structural neighborhood.基于残基结构邻域建模自动预测催化残基。
BMC Bioinformatics. 2010 Mar 3;11:115. doi: 10.1186/1471-2105-11-115.
7
Active site prediction using evolutionary and structural information.利用进化和结构信息进行活性位点预测。
Bioinformatics. 2010 Mar 1;26(5):617-24. doi: 10.1093/bioinformatics/btq008. Epub 2010 Jan 14.
8
Several appropriate background distributions for entropy-based protein sequence conservation measures.基于熵的蛋白质序列保守性度量的几种适当的背景分布。
J Theor Biol. 2010 Jan 21;262(2):317-22. doi: 10.1016/j.jtbi.2009.09.030. Epub 2009 Oct 4.
9
Prediction of catalytic residues using the variation of stereochemical properties.利用立体化学性质的变化预测催化残基。
Protein J. 2009 Jan;28(1):29-33. doi: 10.1007/s10930-008-9161-0.
10
Using genome-wide protein sequence data to predict amino acid conservation.
Protein J. 2008 Sep;27(6):401-7. doi: 10.1007/s10930-008-9150-3.