• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepPPSite:一种基于深度学习的模型,用于利用有效的序列信息分析和预测磷酸化位点。

DeepPPSite: A deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information.

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China.

School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China.

出版信息

Anal Biochem. 2021 Jan 1;612:113955. doi: 10.1016/j.ab.2020.113955. Epub 2020 Sep 16.

DOI:10.1016/j.ab.2020.113955
PMID:32949607
Abstract

Phosphorylation is a ubiquitous type of post-translational modification (PTM) that occurs in both eukaryotic and prokaryotic cells where in a phosphate group binds with amino acid residues. These specific residues, i.e., serine (S), threonine (T), and tyrosine (Y), exhibit diverse functions at the molecular level. Recent studies have determined that some diseases such as cancer, diabetes, and neurodegenerative diseases are caused by abnormal phosphorylation. Based on its potential applications in biological research and drug development, the large-scale identification of phosphorylation sites has attracted interest. Existing wet-lab technologies for targeting phosphorylation sites are overpriced and time consuming. Thus, computational algorithms that can efficiently accelerate the annotation of phosphorylation sites from massive protein sequences are needed. Numerous machine learning-based methods have been implemented for phosphorylation sites prediction. However, despite extensive efforts, existing computational approaches continue to have inadequate performance, particularly in terms of overall ACC, MCC, and AUC. In this paper, we report a novel deep learning-based predictor to overcome these performance hurdles, DeepPPSite, which was constructed using a stacked long short-term memory recurrent network for predicting phosphorylation sites. The proposed technique expediently learns the protein representations from conjoint protein descriptors. The experimental results indicated that our model achieved superior performance on the training dataset for S, T and Y, with MCC values of 0.608, 0.602, and 0.558, respectively, using a 10-fold cross-validation test. We further determined the generalization efficacy of the proposed predictor DeepPPSite by conducting a rigorous independent test. The predictive MCC values were 0.358, 0.356, and 0.350 for the S, T, and Y phosphorylation sites, respectively. Rigorous cross-validation and independent validation tests for the three types of phosphorylation sites demonstrated that the designed DeepPPSite tool significantly outperforms state-of-the-art methods.

摘要

磷酸化是一种普遍存在的翻译后修饰(PTM)类型,发生在真核和原核细胞中,其中磷酸基团与氨基酸残基结合。这些特定的残基,即丝氨酸(S)、苏氨酸(T)和酪氨酸(Y),在分子水平上具有多种功能。最近的研究表明,一些疾病,如癌症、糖尿病和神经退行性疾病,是由异常磷酸化引起的。基于其在生物研究和药物开发中的潜在应用,大规模鉴定磷酸化位点引起了人们的兴趣。现有的针对磷酸化位点的湿实验室技术价格昂贵且耗时。因此,需要能够有效地加速从大量蛋白质序列中注释磷酸化位点的计算算法。已经实施了许多基于机器学习的方法来预测磷酸化位点。然而,尽管付出了广泛的努力,现有的计算方法在整体 ACC、MCC 和 AUC 方面仍然表现不佳。在本文中,我们报告了一种新的基于深度学习的预测器,即 DeepPPSite,用于克服这些性能障碍,该预测器使用堆叠长短期记忆递归网络构建,用于预测磷酸化位点。该技术方便地从联合蛋白质描述符中学习蛋白质表示。实验结果表明,我们的模型在 S、T 和 Y 的训练数据集上取得了卓越的性能,使用 10 折交叉验证测试,MCC 值分别为 0.608、0.602 和 0.558。我们通过进行严格的独立测试进一步确定了所提出的预测器 DeepPPSite 的泛化效果。对于 S、T 和 Y 磷酸化位点,预测的 MCC 值分别为 0.358、0.356 和 0.350。对于三种类型的磷酸化位点的严格交叉验证和独立验证测试表明,所设计的 DeepPPSite 工具明显优于最先进的方法。

相似文献

1
DeepPPSite: A deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information.DeepPPSite:一种基于深度学习的模型,用于利用有效的序列信息分析和预测磷酸化位点。
Anal Biochem. 2021 Jan 1;612:113955. doi: 10.1016/j.ab.2020.113955. Epub 2020 Sep 16.
2
Boosting phosphorylation site prediction with sequence feature-based machine learning.基于序列特征的机器学习提高磷酸化位点预测。
Proteins. 2020 Feb;88(2):284-291. doi: 10.1002/prot.25801. Epub 2019 Aug 22.
3
DeepSSPred: A Deep Learning Based Sulfenylation Site Predictor Via a Novel nSegmented Optimize Federated Feature Encoder.DeepSSPred:一种基于深度学习的新型 nSegmented Optimize 联邦特征编码器的硫化位点预测器。
Protein Pept Lett. 2021;28(6):708-721. doi: 10.2174/0929866527666201202103411.
4
Prediction of phosphorylation sites based on Krawtchouk image moments.基于克劳特楚克图像矩的磷酸化位点预测。
Proteins. 2017 Dec;85(12):2231-2238. doi: 10.1002/prot.25388. Epub 2017 Sep 29.
5
A novel sequence-based method for phosphorylation site prediction with feature selection and analysis.一种基于序列的新型磷酸化位点预测方法,具有特征选择与分析功能。
Protein Pept Lett. 2012 Jan;19(1):70-8. doi: 10.2174/092986612798472893.
6
Predicting protein phosphorylation sites in soybean using interpretable deep tabular learning network.利用可解释的深度表格学习网络预测大豆中的蛋白质磷酸化位点。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac015.
7
Predicting phosphorylation sites using machine learning by integrating the sequence, structure, and functional information of proteins.利用机器学习整合蛋白质的序列、结构和功能信息来预测磷酸化位点。
J Transl Med. 2021 May 24;19(1):218. doi: 10.1186/s12967-021-02851-0.
8
Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.通过将三种编码方案与随机森林分类器融合,预测丝氨酸磷酸化位点在裂殖酵母中的映射。
Sci Rep. 2022 Feb 16;12(1):2632. doi: 10.1038/s41598-022-06529-5.
9
A novel method for predicting post-translational modifications on serine and threonine sites by using site-modification network profiles.一种利用位点修饰网络图谱预测丝氨酸和苏氨酸位点翻译后修饰的新方法。
Mol Biosyst. 2015 Nov;11(11):3092-100. doi: 10.1039/c5mb00384a.
10
Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.通过比较人类工程特征和深度表示来优化蛋白质丝氨酸磷酸化预测。
Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.

引用本文的文献

1
An efficient machine-learning framework for predicting protein post-translational modification sites.一种用于预测蛋白质翻译后修饰位点的高效机器学习框架。
Sci Rep. 2025 Aug 25;15(1):31179. doi: 10.1038/s41598-025-13178-x.
2
GraphPhos: Predict Protein-Phosphorylation Sites Based on Graph Neural Networks.GraphPhos:基于图神经网络预测蛋白质磷酸化位点
Int J Mol Sci. 2025 Jan 23;26(3):941. doi: 10.3390/ijms26030941.
3
A Computational Predictor for Accurate Identification of Tumor Homing Peptides by Integrating Sequential and Deep BiLSTM Features.
一种通过整合序列和深度 BiLSTM 特征来准确识别肿瘤归巢肽的计算预测器。
Interdiscip Sci. 2024 Jun;16(2):503-518. doi: 10.1007/s12539-024-00628-9. Epub 2024 May 11.
4
H2Opred: a robust and efficient hybrid deep learning model for predicting 2'-O-methylation sites in human RNA.H2Opred:一种用于预测人 RNA 2'-O-甲基化位点的稳健高效的混合深度学习模型。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad476.
5
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.机器学习和算法方法在蛋白质磷酸化位点预测中的研究进展综述
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.
6
Deep Learning in Phosphoproteomics: Methods and Application in Cancer Drug Discovery.磷酸化蛋白质组学中的深度学习:方法及其在癌症药物发现中的应用
Proteomes. 2023 May 2;11(2):16. doi: 10.3390/proteomes11020016.
7
Mini-review: Recent advances in post-translational modification site prediction based on deep learning.小型综述:基于深度学习的翻译后修饰位点预测的最新进展
Comput Struct Biotechnol J. 2022 Jun 30;20:3522-3532. doi: 10.1016/j.csbj.2022.06.045. eCollection 2022.
8
A hybrid feature extraction scheme for efficient malonylation site prediction.一种用于高效预测琥珀酰化位点的混合特征提取方案。
Sci Rep. 2022 Apr 6;12(1):5756. doi: 10.1038/s41598-022-08555-9.
9
A Transfer-Learning-Based Deep Convolutional Neural Network for Predicting Leukemia-Related Phosphorylation Sites from Protein Primary Sequences.基于迁移学习的深度卷积神经网络用于从蛋白质一级序列预测与白血病相关的磷酸化位点。
Int J Mol Sci. 2022 Feb 3;23(3):1741. doi: 10.3390/ijms23031741.
10
PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.PScL-HDeep:基于图像的人类组织蛋白亚细胞定位预测,使用基于手工和深度学习特征的两层特征选择的集成学习方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab278.