IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1703-1714. doi: 10.1109/TCBB.2020.3040747. Epub 2022 Jun 3.
Among all the PTMs, the protein phosphorylation is pivotal for various pathological and physiological processes. About 30 percent of eukaryotic proteins undergo the phosphorylation modification, leading to various changes in conformation, function, stability, localization, and so forth. In eukaryotic proteins, phosphorylation occurs on serine (S), Threonine (T) and Tyrosine (Y) residues. Among these all, serine phosphorylation has its own importance as it is associated with various importance biological processes, including energy metabolism, signal transduction pathways, cell cycling, and apoptosis. Thus, its identification is important, however, the in vitro, ex vivo and in vivo identification can be laborious, time-taking and costly. There is a dire need of an efficient and accurate computational model to help researchers and biologists identifying these sites, in an easy manner. Herein, we propose a novel predictor for identification of Phosphoserine sites (PhosS) in proteins, by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) with deep features. We used well-known DNNs for both the tasks of learning a feature representation of peptide sequences and performing classifications. Among different DNNs, the best score is shown by Covolutional Neural Network based model which renders CNN based prediction model the best for Phosphoserine prediction. Based on these results, it is concluded that the proposed model can help to identify PhosS sites in a very efficient and accurate manner which can help scientists understand the mechanism of this modification in proteins.
在所有的翻译后修饰中,蛋白质磷酸化对于各种病理和生理过程至关重要。大约 30%的真核蛋白经历磷酸化修饰,导致构象、功能、稳定性、定位等方面的各种变化。在真核蛋白中,丝氨酸 (S)、苏氨酸 (T) 和酪氨酸 (Y) 残基上发生磷酸化。在所有这些中,丝氨酸磷酸化因其与各种重要的生物过程有关而具有重要意义,包括能量代谢、信号转导途径、细胞周期和细胞凋亡。因此,其鉴定很重要,但是,体外、离体和体内鉴定可能既费力、耗时又昂贵。因此,非常需要一种高效、准确的计算模型来帮助研究人员和生物学家以简单的方式识别这些位点。在此,我们通过整合 Chou 的伪氨基酸组成 (PseAAC) 与深度特征,提出了一种用于鉴定蛋白质中磷酸丝氨酸位点 (PhosS) 的新型预测器。我们使用了著名的 DNN 来学习肽序列的特征表示和执行分类。在不同的 DNN 中,基于卷积神经网络的模型表现出最佳分数,这使得基于卷积神经网络的预测模型成为磷酸丝氨酸预测的最佳模型。基于这些结果,可以得出结论,所提出的模型可以帮助以非常高效和准确的方式识别 PhosS 位点,这有助于科学家理解蛋白质中这种修饰的机制。