Xue Yu, Ren Jian, Gao Xinjiao, Jin Changjiang, Wen Longping, Yao Xuebiao
Hefei National Laboratory for Physical Sciences at Microscale and School of Life Sciences, University of Science and Technology of China, Hefei, Anhui 230027, China.
Mol Cell Proteomics. 2008 Sep;7(9):1598-608. doi: 10.1074/mcp.M700574-MCP200. Epub 2008 May 6.
Identification of protein phosphorylation sites with their cognate protein kinases (PKs) is a key step to delineate molecular dynamics and plasticity underlying a variety of cellular processes. Although nearly 10 kinase-specific prediction programs have been developed, numerous PKs have been casually classified into subgroups without a standard rule. For large scale predictions, the false positive rate has also never been addressed. In this work, we adopted a well established rule to classify PKs into a hierarchical structure with four levels, including group, family, subfamily, and single PK. In addition, we developed a simple approach to estimate the theoretically maximal false positive rates. The on-line service and local packages of the GPS (Group-based Prediction System) 2.0 were implemented in Java with the modified version of the Group-based Phosphorylation Scoring algorithm. As the first stand alone software for predicting phosphorylation, GPS 2.0 can predict kinase-specific phosphorylation sites for 408 human PKs in hierarchy. A large scale prediction of more than 13,000 mammalian phosphorylation sites by GPS 2.0 was exhibited with great performance and remarkable accuracy. Using Aurora-B as an example, we also conducted a proteome-wide search and provided systematic prediction of Aurora-B-specific substrates including protein-protein interaction information. Thus, the GPS 2.0 is a useful tool for predicting protein phosphorylation sites and their cognate kinases and is freely available on line.
鉴定蛋白质磷酸化位点及其相关蛋白激酶(PKs)是描绘多种细胞过程背后分子动力学和可塑性的关键步骤。尽管已经开发了近10种激酶特异性预测程序,但众多PKs被随意分类到亚组中,没有标准规则。对于大规模预测,误报率也从未得到解决。在这项工作中,我们采用了一个成熟的规则,将PKs分类为具有四个层次的层次结构,包括组、家族、亚家族和单个PK。此外,我们开发了一种简单的方法来估计理论上的最大误报率。GPS(基于组的预测系统)2.0的在线服务和本地包用Java实现,并采用了基于组的磷酸化评分算法的修改版本。作为第一个用于预测磷酸化的独立软件,GPS 2.0可以分层预测408种人类PKs的激酶特异性磷酸化位点。GPS 2.0对超过13000个哺乳动物磷酸化位点进行了大规模预测,表现出了出色的性能和显著的准确性。以Aurora-B为例,我们还进行了全蛋白质组搜索,并提供了包括蛋白质-蛋白质相互作用信息在内的Aurora-B特异性底物的系统预测。因此,GPS 2.0是预测蛋白质磷酸化位点及其相关激酶的有用工具,可在线免费获取。