National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India.
Bioinformatics. 2010 Jan 15;26(2):189-97. doi: 10.1093/bioinformatics/btp633. Epub 2009 Nov 12.
In silico methods are being widely used for identifying substrates for various kinases and deciphering cell signaling networks. However, most of the available phosphorylation site prediction methods use motifs or profiles derived from a known data set of kinase substrates and hence, their applicability is limited to only those kinase families for which experimental substrate data is available. This prompted us to develop a novel multi-scale structure-based approach which does not require training using experimental substrate data.
In this work, for the first time, we have used residue-based statistical pair potentials for scoring the binding energy of various substrate peptides in complex with kinases. Extensive benchmarking on Phospho.ELM data set indicate that our method outperforms other structure-based methods and has a prediction accuracy comparable to available sequence-based methods. We also demonstrate that the rank of the true substrate can be further improved, if the high-scoring candidate substrates that are short-listed based on pair potential score, are modeled using all atom forcefield and MM/PBSA approach.
Supplementary data are available at Bioinformatics Online.
计算方法被广泛用于鉴定各种激酶的底物,并破译细胞信号网络。然而,大多数现有的磷酸化位点预测方法使用来自已知激酶底物数据集的基序或模式,因此,它们的适用性仅限于那些具有实验底物数据的激酶家族。这促使我们开发了一种新的基于多尺度结构的方法,该方法不需要使用实验底物数据进行训练。
在这项工作中,我们首次使用基于残基的统计对势来对与激酶结合的各种底物肽的结合能进行评分。在 Phospho.ELM 数据集上的广泛基准测试表明,我们的方法优于其他基于结构的方法,并且具有与可用基于序列的方法相当的预测准确性。我们还证明,如果基于对势得分筛选出的高分候选底物使用全原子力场和 MM/PBSA 方法进行建模,则真实底物的排名可以进一步提高。
补充数据可在生物信息学在线获得。