结合改进遗传算法与支持向量机预测蛋白质结构类别

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

作者信息

Li Z-C, Zhou X-B, Lin Y-R, Zou X-Y

机构信息

School of Chemistry and Chemical Engineering, Sun Yat-Sen University, 510275, Guangzhou, People's Republic of China.

出版信息

Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

DOI:10.1007/s00726-008-0084-z

PMID:18427714

Abstract

Structural class characterizes the overall folding type of a protein or its domain. Most of the existing methods for determining the structural class of a protein are based on a group of features that only possesses a kind of discriminative information for the prediction of protein structure class. However, different types of discriminative information associated with primary sequence have been completely missed, which undoubtedly has reduced the success rate of prediction. We present a novel method for the prediction of protein structure class by coupling the improved genetic algorithm (GA) with the support vector machine (SVM). This improved GA was applied to the selection of an optimized feature subset and the optimization of SVM parameters. Jackknife tests on the working datasets indicated that the prediction accuracies for the different classes were in the range of 97.8-100% with an overall accuracy of 99.5%. The results indicate that the approach has a high potential to become a useful tool in bioinformatics.

摘要

结构类别表征蛋白质或其结构域的整体折叠类型。大多数现有的确定蛋白质结构类别的方法基于一组特征，这些特征对于预测蛋白质结构类别仅具有一种判别信息。然而，与一级序列相关的不同类型的判别信息完全被遗漏了，这无疑降低了预测的成功率。我们提出了一种通过将改进的遗传算法（GA）与支持向量机（SVM）相结合来预测蛋白质结构类别的新方法。这种改进的GA被应用于选择优化的特征子集和SVM参数的优化。对工作数据集进行的留一法测试表明，不同类别的预测准确率在97.8 - 100%范围内，总体准确率为99.5%。结果表明，该方法有很大潜力成为生物信息学中的一种有用工具。

相似文献

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

A high-accuracy protein structural class prediction algorithm using predicted secondary structural information.

J Theor Biol. 2010 Dec 7;267(3):272-5. doi: 10.1016/j.jtbi.2010.09.007. Epub 2010 Sep 8.

Predicting protein structural class by SVM with class-wise optimized features and decision probabilities.

J Theor Biol. 2008 Jul 21;253(2):375-80. doi: 10.1016/j.jtbi.2008.02.031. Epub 2008 Mar 4.

Predicting protein structural class based on multi-features fusion.

J Theor Biol. 2008 Jul 21;253(2):388-92. doi: 10.1016/j.jtbi.2008.03.009. Epub 2008 Mar 14.

Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile.

Biochimie. 2010 Oct;92(10):1330-4. doi: 10.1016/j.biochi.2010.06.013. Epub 2010 Jun 23.

Prediction of protein subcellular localization.

Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018.

[Protein structural class prediction with binary tree-based support vector machines].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2008 Aug;25(4):921-4.

Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.

Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.

Identification of catalytic residues from protein structure using support vector machine with sequence and structural features.

Biochem Biophys Res Commun. 2008 Mar 14;367(3):630-4. doi: 10.1016/j.bbrc.2008.01.038. Epub 2008 Jan 17.

Multi-class support vector machines for protein secondary structure prediction.

Genome Inform. 2003;14:218-27.

引用本文的文献

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

Evol Bioinform Online. 2016 Dec 4;12:285-302. doi: 10.4137/EBO.S40912. eCollection 2016.

Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach.

Int J Mol Sci. 2015 Dec 24;17(1):15. doi: 10.3390/ijms17010015.

Customised fragments libraries for protein structure prediction based on structural class annotations.

BMC Bioinformatics. 2015 Apr 29;16(1):136. doi: 10.1186/s12859-015-0576-2.

PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

PLoS One. 2014 Mar 27;9(3):e92863. doi: 10.1371/journal.pone.0092863. eCollection 2014.

Proposing a highly accurate protein structural class predictor using segmentation-based features.

BMC Genomics. 2014;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2164-15-S1-S2. Epub 2014 Jan 24.

A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition.

BMC Bioinformatics. 2013 Jul 24;14:233. doi: 10.1186/1471-2105-14-233.

Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm.

BMC Bioinformatics. 2010 Jun 16;11:325. doi: 10.1186/1471-2105-11-325.

Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM).

Mol Divers. 2011 Feb;15(1):269-89. doi: 10.1007/s11030-010-9234-9. Epub 2010 Mar 20.

Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

BMC Bioinformatics. 2009 Dec 13;10:414. doi: 10.1186/1471-2105-10-414.

Sequence physical properties encode the global organization of protein structure space.

Proc Natl Acad Sci U S A. 2009 Aug 25;106(34):14345-8. doi: 10.1073/pnas.0903433106. Epub 2009 Aug 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

结合改进遗传算法与支持向量机预测蛋白质结构类别

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

作者信息

Li Z-C, Zhou X-B, Lin Y-R, Zou X-Y

机构信息

School of Chemistry and Chemical Engineering, Sun Yat-Sen University, 510275, Guangzhou, People's Republic of China.

出版信息

Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

DOI:10.1007/s00726-008-0084-z

PMID:18427714

Abstract

摘要

结合改进遗传算法与支持向量机预测蛋白质结构类别

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

结合改进遗传算法与支持向量机预测蛋白质结构类别

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

作者信息

机构信息

出版信息

相似文献

引用本文的文献