Department of Chemistry, York College of the City University of New York, Jamaica, New York, United States of America; Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, Rockefeller University, New York, New York, United States of America.
PLoS One. 2007 Aug 1;2(7):e656. doi: 10.1371/journal.pone.0000656.
Protein phosphorylation, mediated by a family of enzymes called cyclin-dependent kinases (Cdks), plays a central role in the cell-division cycle of eukaryotes. Phosphorylation by Cdks directs the cell cycle by modifying the function of regulators of key processes such as DNA replication and mitotic progression. Here, we present a novel computational procedure to predict substrates of the cyclin-dependent kinase Cdc28 (Cdk1) in the Saccharomyces cerevisiae. Currently, most computational phosphorylation site prediction procedures focus solely on local sequence characteristics. In the present procedure, we model Cdk substrates based on both local and global characteristics of the substrates. Thus, we define the local sequence motifs that represent the Cdc28 phosphorylation sites and subsequently model clustering of these motifs within the protein sequences. This restraint reflects the observation that many known Cdk substrates contain multiple clustered phosphorylation sites. The present strategy defines a subset of the proteome that is highly enriched for Cdk substrates, as validated by comparing it to a set of bona fide, published, experimentally characterized Cdk substrates which was to our knowledge, comprehensive at the time of writing. To corroborate our model, we compared its predictions with three experimentally independent Cdk proteomic datasets and found significant overlap. Finally, we directly detected in vivo phosphorylation at Cdk motifs for selected putative substrates using mass spectrometry.
蛋白质磷酸化是由一类称为细胞周期蛋白依赖性激酶(Cdks)的酶介导的,在真核生物的细胞分裂周期中起着核心作用。Cdks 的磷酸化通过修饰 DNA 复制和有丝分裂进程等关键过程的调节剂的功能来指导细胞周期。在这里,我们提出了一种新的计算程序,用于预测酿酒酵母中的细胞周期蛋白依赖性激酶 Cdc28(Cdk1)的底物。目前,大多数计算磷酸化位点预测程序仅专注于局部序列特征。在本程序中,我们基于底物的局部和全局特征来构建 Cdk 底物模型。因此,我们定义了代表 Cdc28 磷酸化位点的局部序列基序,随后在蛋白质序列中对这些基序进行聚类建模。这种约束反映了这样一种观察结果,即许多已知的 Cdk 底物包含多个聚集的磷酸化位点。本策略定义了一个高度富含 Cdk 底物的蛋白质组子集,通过与一组公认的、已发表的、经过实验表征的 Cdk 底物进行比较来验证,该数据集在撰写本文时是全面的。为了证实我们的模型,我们将其预测与三个独立的实验 Cdk 蛋白质组数据集进行了比较,发现有显著的重叠。最后,我们使用质谱法直接检测了选定假定底物上 Cdk 基序的体内磷酸化。