Zuo Yong-Chun, Peng Yong, Liu Li, Chen Wei, Yang Lei, Fan Guo-Liang
The Key Laboratory of Mammalian Reproductive Biology and Biotechnology of the Ministry of Education, Inner Mongolia University, Hohhot 010021, China.
Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.
Anal Biochem. 2014 Aug 1;458:14-9. doi: 10.1016/j.ab.2014.04.032. Epub 2014 May 4.
Peroxidases as universal enzymes are essential for the regulation of reactive oxygen species levels and play major roles in both disease prevention and human pathologies. Automated prediction of functional protein localization is rarely reported and also is important for designing new drugs and drug targets. In this study, we first propose a support vector machine (SVM)-based method to predict peroxidase subcellular localization. Various Chou' pseudo amino acid descriptors and gene ontology (GO)-homology patterns were selected as input features to multiclass SVM. Prediction results showed that the smoothed PSSM encoding pattern performed better than the other approaches. The best overall prediction accuracy was 87.0% in a jackknife test using a PSSM profile of pattern with width=5. We also demonstrate that the present GO annotation is far from complete or deep enough for annotating proteins with a specific function.
过氧化物酶作为通用酶,对于调节活性氧水平至关重要,在疾病预防和人类病理学中都发挥着重要作用。关于功能蛋白定位的自动化预测鲜有报道,但其对于新药和药物靶点的设计也很重要。在本研究中,我们首次提出一种基于支持向量机(SVM)的方法来预测过氧化物酶的亚细胞定位。选择了各种周氏伪氨基酸描述符和基因本体(GO)同源模式作为多类SVM的输入特征。预测结果表明,平滑后的位置特异性得分矩阵(PSSM)编码模式比其他方法表现更好。在使用宽度为5的模式的PSSM图谱进行留一法检验时,最佳总体预测准确率为87.0%。我们还证明,目前的GO注释对于注释具有特定功能的蛋白质来说还远远不够完整或深入。