Department of Computer Science, University of Texas Rio Grande Valley, Edinburg, TX 78531, USA.
Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, USA.
Bioinformatics. 2022 Aug 2;38(15):3785-3793. doi: 10.1093/bioinformatics/btac406.
Protein phosphorylation is a ubiquitous regulatory mechanism that plays a central role in cellular signaling. According to recent estimates, up to 70% of human proteins can be phosphorylated. Therefore, the characterization of phosphorylation dynamics is critical for understanding a broad range of biological and biochemical processes. Technologies based on mass spectrometry are rapidly advancing to meet the needs for high-throughput screening of phosphorylation. These technologies enable untargeted quantification of thousands of phosphorylation sites in a given sample. Many labs are already utilizing these technologies to comprehensively characterize signaling landscapes by examining perturbations with drugs and knockdown approaches, or by assessing diverse phenotypes in cancers, neuro-degerenational diseases, infectious diseases and normal development.
We comprehensively investigate the concept of 'co-phosphorylation' (Co-P), defined as the correlated phosphorylation of a pair of phosphosites across various biological states. We integrate nine publicly available phosphoproteomics datasets for various diseases (including breast cancer, ovarian cancer and Alzheimer's disease) and utilize functional data related to sequence, evolutionary histories, kinase annotations and pathway annotations to investigate the functional relevance of Co-P. Our results across a broad range of studies consistently show that functionally associated sites tend to exhibit significant positive or negative Co-P. Specifically, we show that Co-P can be used to predict with high precision the sites that are on the same pathway or that are targeted by the same kinase. Overall, these results establish Co-P as a useful resource for analyzing phosphoproteins in a network context, which can help extend our knowledge on cellular signaling and its dysregulation.
github.com/msayati/Cophosphorylation. This research used the publicly available datasets published by other researchers as cited in the manuscript.
Supplementary data are available at Bioinformatics online.
蛋白质磷酸化是一种普遍存在的调节机制,在细胞信号转导中起着核心作用。根据最近的估计,多达 70%的人类蛋白质可以被磷酸化。因此,磷酸化动力学的特征对于理解广泛的生物和生化过程至关重要。基于质谱的技术正在迅速发展,以满足高通量筛选磷酸化的需求。这些技术能够在给定的样本中对数千个磷酸化位点进行非靶向定量。许多实验室已经利用这些技术通过用药物和敲除方法进行干扰,或通过评估癌症、神经退行性疾病、传染病和正常发育中的各种表型,全面描述信号转导景观。
我们全面研究了“共磷酸化”(Co-P)的概念,它被定义为在各种生物状态下一对磷酸化位点的相关磷酸化。我们整合了九个公开的磷酸蛋白质组学数据集,用于各种疾病(包括乳腺癌、卵巢癌和阿尔茨海默病),并利用与序列、进化历史、激酶注释和途径注释相关的功能数据来研究 Co-P 的功能相关性。我们在广泛的研究中的结果一致表明,功能相关的位点往往表现出显著的正或负 Co-P。具体来说,我们表明 Co-P 可以用于高精度地预测位于同一途径或被同一激酶靶向的位点。总的来说,这些结果确立了 Co-P 作为在网络背景下分析磷酸蛋白的有用资源,可以帮助扩展我们对细胞信号及其失调的认识。
github.com/msayati/Cophosphorylation。这项研究使用了本文献中引用的其他研究人员发布的公开数据集。
补充数据可在生物信息学在线获取。