Wirbel Jakob, Cutillas Pedro, Saez-Rodriguez Julio
Joint Research Center for Computational Biomedicine (JRC-COMBINE), Faculty of Medicine, RWTH Aachen University, MTZ Pauwelsstrasse 19, D-52074, Aachen, Germany.
Institute for Pharmacy and Molecular Biotechnology (IPMB), University of Heidelberg, 69120, Heidelberg, Germany.
Methods Mol Biol. 2018;1711:103-132. doi: 10.1007/978-1-4939-7493-1_6.
Cellular signaling, predominantly mediated by phosphorylation through protein kinases, is found to be deregulated in most cancers. Accordingly, protein kinases have been subject to intense investigations in cancer research, to understand their role in oncogenesis and to discover new therapeutic targets. Despite great advances, an understanding of kinase dysfunction in cancer is far from complete.A powerful tool to investigate phosphorylation is mass-spectrometry (MS)-based phosphoproteomics, which enables the identification of thousands of phosphorylated peptides in a single experiment. Since every phosphorylation event results from the activity of a protein kinase, high-coverage phosphoproteomics data should indirectly contain comprehensive information about the activity of protein kinases.In this chapter, we discuss the use of computational methods to predict kinase activity scores from MS-based phosphoproteomics data. We start with a short explanation of the fundamental features of the phosphoproteomics data acquisition process from the perspective of the computational analysis. Next, we briefly review the existing databases with experimentally verified kinase-substrate relationships and present a set of bioinformatic tools to discover novel kinase targets. We then introduce different methods to infer kinase activities from phosphoproteomics data and these kinase-substrate relationships. We illustrate their application with a detailed protocol of one of the methods, KSEA (Kinase Substrate Enrichment Analysis). This method is implemented in Python within the framework of the open-source Kinase Activity Toolbox (kinact), which is freely available at http://github.com/saezlab/kinact/ .
细胞信号传导主要由蛋白激酶介导的磷酸化作用调控,而在大多数癌症中发现这种信号传导失调。因此,蛋白激酶一直是癌症研究中的重点研究对象,旨在了解它们在肿瘤发生中的作用并发现新的治疗靶点。尽管取得了巨大进展,但对癌症中激酶功能障碍的理解仍远未完善。一种用于研究磷酸化作用的强大工具是基于质谱(MS)的磷酸化蛋白质组学,它能够在一次实验中鉴定出数千种磷酸化肽段。由于每一个磷酸化事件都是由蛋白激酶的活性导致的,高覆盖率的磷酸化蛋白质组学数据应该间接包含有关蛋白激酶活性的全面信息。在本章中,我们将讨论使用计算方法从基于MS的磷酸化蛋白质组学数据预测激酶活性评分。我们首先从计算分析的角度简要解释磷酸化蛋白质组学数据采集过程的基本特征。接下来,我们简要回顾已有的具有经实验验证的激酶-底物关系的数据库,并介绍一组用于发现新激酶靶点的生物信息学工具。然后,我们介绍从磷酸化蛋白质组学数据以及这些激酶-底物关系推断激酶活性的不同方法。我们通过其中一种方法KSEA(激酶底物富集分析)的详细方案来说明它们的应用。该方法是在开源的激酶活性工具箱(kinact)框架内用Python实现的,可从http://github.com/saezlab/kinact/免费获取。