基于多特征融合预测蛋白质结构类别。

Predicting protein structural class based on multi-features fusion.

作者信息

Chen Chao, Chen Li-Xuan, Zou Xiao-Yong, Cai Pei-Xiang

机构信息

School of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, PR China.

出版信息

J Theor Biol. 2008 Jul 21;253(2):388-92. doi: 10.1016/j.jtbi.2008.03.009. Epub 2008 Mar 14.

DOI:10.1016/j.jtbi.2008.03.009

PMID:18423494

Abstract

Structural class characterizes the overall folding type of a protein or its domain and the prediction of protein structural class has become both an important and a challenging topic in protein science. Moreover, the prediction itself can stimulate the development of novel predictors that may be straightforwardly applied to many other relational areas. In this paper, 10 frequently used sequence-derived structural and physicochemical features, which can be easily computed by the PROFEAT (Protein Features) web server, were taken as inputs of support vector machines to develop statistical learning models for predicting the protein structural class. More importantly, a strategy of merging different features, called best-first search, was developed. It was shown through the rigorous jackknife cross-validation test that the success rates by our method were significantly improved. We anticipate that the present method may also have important impacts on boosting the predictive accuracies for a series of other protein attributes, such as subcellular localization, membrane types, enzyme family and subfamily classes, among many others.

摘要

结构类别表征蛋白质或其结构域的整体折叠类型，蛋白质结构类别的预测已成为蛋白质科学中一个重要且具有挑战性的课题。此外，预测本身可以推动新型预测器的开发，这些预测器可直接应用于许多其他相关领域。在本文中，选取了10个常用的源自序列的结构和物理化学特征（可通过PROFEAT（蛋白质特征）网络服务器轻松计算得出）作为支持向量机的输入，以开发用于预测蛋白质结构类别的统计学习模型。更重要的是，开发了一种称为最佳优先搜索的合并不同特征的策略。通过严格的留一法交叉验证测试表明，我们方法的成功率得到了显著提高。我们预计，本方法可能还会对提高一系列其他蛋白质属性（如亚细胞定位、膜类型、酶家族和亚家族类别等）的预测准确性产生重要影响。

相似文献

Predicting protein structural class based on multi-features fusion.

J Theor Biol. 2008 Jul 21;253(2):388-92. doi: 10.1016/j.jtbi.2008.03.009. Epub 2008 Mar 14.

Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.

Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.

Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes.

Amino Acids. 2007 Nov;33(4):623-9. doi: 10.1007/s00726-007-0496-1. Epub 2007 Feb 19.

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

Computer prediction of allergen proteins from sequence-derived protein structural and physicochemical properties.

Mol Immunol. 2007 Jan;44(4):514-20. doi: 10.1016/j.molimm.2006.02.010. Epub 2006 Mar 23.

Boosting classifier for predicting protein domain structural class.

Biochem Biophys Res Commun. 2005 Aug 19;334(1):213-7. doi: 10.1016/j.bbrc.2005.06.075.

MODEL-molecular descriptor lab: a web-based server for computing structural and physicochemical features of compounds.

Biotechnol Bioeng. 2007 Jun 1;97(2):389-96. doi: 10.1002/bit.21214.

Using supervised fuzzy clustering to predict protein structural classes.

Biochem Biophys Res Commun. 2005 Aug 26;334(2):577-81. doi: 10.1016/j.bbrc.2005.06.128.

A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.

In Silico Biol. 2008;8(2):129-40.

[Protein structural class prediction with binary tree-based support vector machines].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2008 Aug;25(4):921-4.

引用本文的文献

A Review for Artificial Intelligence Based Protein Subcellular Localization.

Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.

Comparative Study on Feature Selection in Protein Structure and Function Prediction.

Comput Math Methods Med. 2022 Oct 11;2022:1650693. doi: 10.1155/2022/1650693. eCollection 2022.

Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab530.

Using Recursive Feature Selection with Random Forest to Improve Protein Structural Class Prediction for Low-Similarity Sequences.

Comput Math Methods Med. 2021 May 7;2021:5529389. doi: 10.1155/2021/5529389. eCollection 2021.

Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.

BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):701. doi: 10.1186/s12859-019-3276-5.

A multi-label classifier for predicting the subcellular localization of gram-negative bacterial proteins with both single and multiple sites.

PLoS One. 2011;6(6):e20592. doi: 10.1371/journal.pone.0020592. Epub 2011 Jun 17.

Some remarks on protein attribute prediction and pseudo amino acid composition.

J Theor Biol. 2011 Mar 21;273(1):236-47. doi: 10.1016/j.jtbi.2010.12.024. Epub 2010 Dec 17.

Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm.

BMC Bioinformatics. 2010 Jun 16;11:325. doi: 10.1186/1471-2105-11-325.

Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

BMC Bioinformatics. 2009 Dec 13;10:414. doi: 10.1186/1471-2105-10-414.

Protein domain boundary predictions: a structural biology perspective.

Open Biochem J. 2009;3:1-8. doi: 10.2174/1874091X00903010001. Epub 2009 Jan 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多特征融合预测蛋白质结构类别。

Predicting protein structural class based on multi-features fusion.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献