Cole Stephen, Prabakaran Sudhakaran
Department of Genetics, University of Cambridge, Downing Site, Cambridge CB2 3EH, UK.
Department of Genetics, University of Cambridge, Downing Site, Cambridge CB2 3EH, UK.
iScience. 2020 Aug 21;23(8):101321. doi: 10.1016/j.isci.2020.101321. Epub 2020 Jul 1.
Phosphorylation sites often have key regulatory functions and are central to many cellular signaling pathways, so mutations that modify them have the potential to contribute to pathological states such as cancer. Although many classifiers exist for prioritization of coding genomic variants, to our knowledge none of them explicitly account for the alteration or creation of kinase recognition motifs that alter protein structure, function, regulation of activity, and interaction networks through modifying the pattern of phosphorylation. We present a novel computational pipeline that uses a random forest classifier to predict the pathogenicity of a variant, according to its direct or indirect effect on local phosphorylation sites and the predicted functional impact of perturbing a phosphorylation event. We call this classifier PhosphoEffect and find that it compares favorably and with increased accuracy to the existing classifier PolyPhen 2.2.2 when tested on a dataset of known variants enriched for phosphorylation sites and their neighbors.
磷酸化位点通常具有关键的调节功能,并且是许多细胞信号通路的核心,因此改变它们的突变有可能导致诸如癌症等病理状态。尽管存在许多用于对编码基因组变异进行优先级排序的分类器,但据我们所知,它们中没有一个明确考虑激酶识别基序的改变或创建,这些基序会通过改变磷酸化模式来改变蛋白质结构、功能、活性调节和相互作用网络。我们提出了一种新颖的计算流程,该流程使用随机森林分类器根据变异对局部磷酸化位点的直接或间接影响以及干扰磷酸化事件的预测功能影响来预测变异的致病性。我们将此分类器称为PhosphoEffect,并发现当在富含磷酸化位点及其邻近区域的已知变异数据集上进行测试时,它与现有的分类器PolyPhen 2.2.2相比具有优势且准确性更高。