Lussana Alessandro, Müller-Dott Sophia, Saez-Rodriguez Julio, Petsalaki Evangelia
European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom.
Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg 69120, Germany.
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae697.
The inference of kinase activity from phosphoproteomics data can point to causal mechanisms driving signalling processes and potential drug targets. Identifying the kinases whose change in activity explains the observed phosphorylation profiles, however, remains challenging, and constrained by the manually curated knowledge of kinase-substrate associations. Recently, experimentally determined substrate sequence specificities of human kinases have become available, but robust methods to exploit this new data for kinase activity inference are still missing. We present PhosX, a method to estimate differential kinase activity from phosphoproteomics data that combines state-of-the-art statistics in enrichment analysis with kinases' substrate sequence specificity information. Using a large phosphoproteomics dataset with known differentially regulated kinases we show that our method identifies upregulated and downregulated kinases by only relying on the input phosphopeptides' sequences and intensity changes. We find that PhosX outperforms the currently available approach for the same task, and performs better or similarly to state-of-the-art methods that rely on previously known kinase-substrate associations. We therefore recommend its use for data-driven kinase activity inference.
PhosX is implemented in Python, open-source under the Apache-2.0 licence, and distributed on the Python Package Index. The code is available on GitHub (https://github.com/alussana/phosx).
从磷酸化蛋白质组学数据推断激酶活性可以揭示驱动信号传导过程的因果机制和潜在的药物靶点。然而,确定其活性变化能够解释观察到的磷酸化谱的激酶仍然具有挑战性,并且受到激酶-底物关联的人工整理知识的限制。最近,已获得实验确定的人类激酶的底物序列特异性,但仍缺乏利用这些新数据进行激酶活性推断的可靠方法。我们提出了PhosX,一种从磷酸化蛋白质组学数据估计差异激酶活性的方法,该方法将富集分析中的最新统计数据与激酶的底物序列特异性信息相结合。使用一个具有已知差异调节激酶的大型磷酸化蛋白质组学数据集,我们表明我们的方法仅通过依赖输入磷酸肽的序列和强度变化就能识别上调和下调的激酶。我们发现PhosX在执行相同任务时优于当前可用的方法,并且与依赖先前已知激酶-底物关联的最新方法表现相当或更好。因此,我们建议将其用于数据驱动的激酶活性推断。
PhosX用Python实现,根据Apache-2.0许可开源,并在Python包索引上分发。代码可在GitHub(https://github.com/alussana/phosx)上获取。