Ju Zhe, Cao Jun-Zhe, Gu Hong
School of Control Science and Engineering, Dalian University of Technology, #2 Ling-gong Road, Dalian 116024, People׳s Republic of China.
J Theor Biol. 2016 May 21;397:145-50. doi: 10.1016/j.jtbi.2016.02.020. Epub 2016 Feb 22.
As a new type of post-translational modification, lysine phosphoglycerylation plays a key role in regulating glycolytic process and metabolism in cells. Due to the traditional experimental methods are time-consuming and labor-intensive, it is important to develop computational methods to identify the potential phosphoglycerylation sites. However, the prediction performance of the existing phosphoglycerylation site predictor is not satisfactory. In this study, a novel predictor named CKSAAP_PhoglySite is developed to predict phosphoglycerylation sites by using composition of k-spaced amino acid pairs and fuzzy support vector machine. On the one hand, after many aspects of assessments, we find the composition of k-spaced amino acid pairs is more suitable for representing the protein sequence around the phosphoglycerylation sites than other encoding schemes. On the other hand, the proposed fuzzy support vector machine algorithm can effectively handle the imbalanced and noisy problem in phosphoglycerylation sites training dataset. Experimental results indicate that CKSAAP_PhoglySite outperforms the existing phosphoglycerylation site predictor Phogly-PseAAC significantly. A matlab software package for CKSAAP_PhoglySite can be freely downloaded from https://github.com/juzhe1120/Matlab_Software/blob/master/CKSAAP_PhoglySite_Matlab_Software.zip.
作为一种新型的翻译后修饰,赖氨酸磷酸甘油化在调节细胞糖酵解过程和代谢中起着关键作用。由于传统实验方法耗时且费力,开发计算方法来识别潜在的磷酸甘油化位点非常重要。然而,现有磷酸甘油化位点预测器的预测性能并不令人满意。在本研究中,开发了一种名为CKSAAP_PhoglySite的新型预测器,通过使用k间隔氨基酸对组成和模糊支持向量机来预测磷酸甘油化位点。一方面,经过多方面评估,我们发现k间隔氨基酸对组成比其他编码方案更适合表示磷酸甘油化位点周围的蛋白质序列。另一方面,所提出的模糊支持向量机算法可以有效处理磷酸甘油化位点训练数据集中的不平衡和噪声问题。实验结果表明,CKSAAP_PhoglySite明显优于现有的磷酸甘油化位点预测器Phogly-PseAAC。可从https://github.com/juzhe1120/Matlab_Software/blob/master/CKSAAP_PhoglySite_Matlab_Software.zip免费下载用于CKSAAP_PhoglySite的Matlab软件包。