• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估新的翻译后修饰位点的预测因子:以羟基化为例的案例研究。

Assessing predictors for new post translational modification sites: A case study on hydroxylation.

机构信息

Department of Biomedical Sciences, University of Padua, Padua, Italy.

出版信息

PLoS Comput Biol. 2020 Jun 22;16(6):e1007967. doi: 10.1371/journal.pcbi.1007967. eCollection 2020 Jun.

DOI:10.1371/journal.pcbi.1007967
PMID:32569263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7332089/
Abstract

Post-translational modification (PTM) sites have become popular for predictor development. However, with the exception of phosphorylation and a handful of other examples, PTMs suffer from a limited number of available training examples and sparsity in protein sequences. Here, proline hydroxylation is taken as an example to compare different methods and evaluate their performance on new experimentally determined sites. As a guide for effective experimental design, predictors require both high specificity and sensitivity. However, the self-reported performance may often not be indicative of prediction quality and detection of new sites is not guaranteed. We have benchmarked seven published hydroxylation site predictors on two newly constructed independent datasets. The self-reported performance is found to widely overestimate the real accuracy measured on independent datasets. No predictor performs better than random on new examples, indicating the refined models do not sufficiently generalize to detect new sites. The number of false positives is high and precision low, in particular for non-collagen proteins whose motifs are not conserved. As hydroxylation site predictors do not generalize for new data, caution is advised when using PTM predictors in the absence of independent evaluations, in particular for highly specific sites involved in signalling.

摘要

翻译后修饰(PTM)位点已成为预测器开发的热门对象。然而,除了磷酸化和少数其他例子外,PTM存在可用训练示例数量有限以及蛋白质序列稀疏的问题。在此,以脯氨酸羟化为例,比较不同方法并评估它们在新的实验确定位点上的性能。作为有效实验设计的指南,预测器需要同时具备高特异性和高灵敏度。然而,自我报告的性能往往可能无法反映预测质量,并且新位点的检测也无法得到保证。我们在两个新构建的独立数据集上对七个已发表的羟化位点预测器进行了基准测试。结果发现,自我报告的性能广泛高估了在独立数据集上测得的实际准确性。在新示例上,没有一个预测器的表现优于随机猜测,这表明优化后的模型不足以泛化以检测新位点。假阳性数量很高且精度很低,特别是对于其基序不保守的非胶原蛋白。由于羟化位点预测器无法对新数据进行泛化,因此在没有独立评估的情况下使用PTM预测器时应谨慎,特别是对于参与信号传导的高度特异性位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/0775f6f8b074/pcbi.1007967.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/417f4d94ddb8/pcbi.1007967.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/9e75a690b493/pcbi.1007967.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/3006d7202c5d/pcbi.1007967.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/3f1da4b0c825/pcbi.1007967.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/50784da6d2d9/pcbi.1007967.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/0775f6f8b074/pcbi.1007967.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/417f4d94ddb8/pcbi.1007967.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/9e75a690b493/pcbi.1007967.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/3006d7202c5d/pcbi.1007967.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/3f1da4b0c825/pcbi.1007967.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/50784da6d2d9/pcbi.1007967.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/553c/7332089/0775f6f8b074/pcbi.1007967.g006.jpg

相似文献

1
Assessing predictors for new post translational modification sites: A case study on hydroxylation.评估新的翻译后修饰位点的预测因子:以羟基化为例的案例研究。
PLoS Comput Biol. 2020 Jun 22;16(6):e1007967. doi: 10.1371/journal.pcbi.1007967. eCollection 2020 Jun.
2
A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites.一种用于预测蛋白质羟基化位点的混合深度学习模型。
Int J Mol Sci. 2018 Sep 18;19(9):2817. doi: 10.3390/ijms19092817.
3
New Achievements in Bioinformatics Prediction of Post Translational Modification of Proteins.蛋白质翻译后修饰的生物信息学预测新进展
Curr Top Med Chem. 2017;17(21):2381-2392. doi: 10.2174/1568026617666170328100908.
4
PTM-ssMP: A Web Server for Predicting Different Types of Post-translational Modification Sites Using Novel Site-specific Modification Profile.PTM-ssMP:一个使用新型的特异性修饰谱预测不同类型的翻译后修饰位点的网络服务器。
Int J Biol Sci. 2018 May 22;14(8):946-956. doi: 10.7150/ijbs.24121. eCollection 2018.
5
PTMscape: an open source tool to predict generic post-translational modifications and map modification crosstalk in protein domains and biological processes.PTMscape:一种用于预测通用翻译后修饰并绘制蛋白质结构域和生物过程中修饰串扰图谱的开源工具。
Mol Omics. 2018 Jun 12;14(3):197-209. doi: 10.1039/c8mo00027a.
6
PhosphOrtholog: a web-based tool for cross-species mapping of orthologous protein post-translational modifications.磷酸化直系同源物:一种用于跨物种映射直系同源蛋白质翻译后修饰的基于网络的工具。
BMC Genomics. 2015 Aug 19;16(1):617. doi: 10.1186/s12864-015-1820-x.
7
EvolStruct-Phogly: incorporating structural properties and evolutionary information from profile bigrams for the phosphoglycerylation prediction.EvolStruct-Phogly:从二联体轮廓中整合结构特性和进化信息,用于磷酸甘油化预测。
BMC Genomics. 2019 Apr 18;19(Suppl 9):984. doi: 10.1186/s12864-018-5383-5.
8
A machine learning strategy for predicting localization of post-translational modification sites in protein-protein interacting regions.一种用于预测蛋白质-蛋白质相互作用区域中翻译后修饰位点定位的机器学习策略。
BMC Bioinformatics. 2016 Aug 17;17(1):307. doi: 10.1186/s12859-016-1165-8.
9
Post-translational modifications in the context of therapeutic proteins.治疗性蛋白质背景下的翻译后修饰
Nat Biotechnol. 2006 Oct;24(10):1241-52. doi: 10.1038/nbt1252.
10
Cross-Species PTM Mapping from Phosphoproteomic Data.基于磷酸化蛋白质组学数据的跨物种蛋白质翻译后修饰图谱绘制
Methods Mol Biol. 2017;1558:459-469. doi: 10.1007/978-1-4939-6783-4_22.

引用本文的文献

1
Uncovering Enzyme-Specific Post-Translational Modifications: An Overview of Current Methods.揭示酶特异性翻译后修饰:当前方法概述
Proteomes. 2025 Aug 11;13(3):37. doi: 10.3390/proteomes13030037.
2
Machine Learning Methods for Gene Selection in Uveal Melanoma.机器学习方法在葡萄膜黑色素瘤中的基因选择。
Int J Mol Sci. 2024 Feb 1;25(3):1796. doi: 10.3390/ijms25031796.
3
Deep Learning Approaches for Detection of Breast Adenocarcinoma Causing Carcinogenic Mutations.深度学习方法在乳腺癌致癌突变检测中的应用

本文引用的文献

1
TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.TagGraph 揭示了来自大型串联质谱数据集的广泛蛋白质修饰图谱。
Nat Biotechnol. 2019 Apr;37(4):469-479. doi: 10.1038/s41587-019-0067-5. Epub 2019 Apr 1.
2
UniProt: a worldwide hub of protein knowledge.UniProt:蛋白质知识的全球枢纽。
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.
3
The Pfam protein families database in 2019.2019 年 Pfam 蛋白质家族数据库。
Int J Mol Sci. 2022 Sep 29;23(19):11539. doi: 10.3390/ijms231911539.
4
In silico prediction of post-translational modifications in therapeutic antibodies.治疗性抗体中翻译后修饰的计算预测。
MAbs. 2022 Jan-Dec;14(1):2023938. doi: 10.1080/19420862.2021.2023938.
5
Current status of PTMs structural databases: applications, limitations and prospects.PTMs 结构数据库的现状:应用、局限性和前景。
Amino Acids. 2022 Apr;54(4):575-590. doi: 10.1007/s00726-021-03119-z. Epub 2022 Jan 12.
6
Development of an experiment-split method for benchmarking the generalization of a PTM site predictor: Lysine methylome as an example.开发一种实验拆分方法,用于基准化 PTM 位点预测器的泛化能力:以赖氨酸甲基组为例。
PLoS Comput Biol. 2021 Dec 8;17(12):e1009682. doi: 10.1371/journal.pcbi.1009682. eCollection 2021 Dec.
7
Comparing a Novel Malformation Syndrome Caused by Pathogenic Variants in to AUTS2 Syndrome.将由[具体基因]致病变体引起的一种新型畸形综合征与AUTS2综合征进行比较。 (你提供的原文中“in”后面缺少具体基因信息)
Front Cell Dev Biol. 2021 Nov 5;9:779009. doi: 10.3389/fcell.2021.779009. eCollection 2021.
8
A global map of associations between types of protein posttranslational modifications and human genetic diseases.蛋白质翻译后修饰类型与人类遗传疾病之间关联的全球图谱。
iScience. 2021 Jul 30;24(8):102917. doi: 10.1016/j.isci.2021.102917. eCollection 2021 Aug 20.
9
DOME: recommendations for supervised machine learning validation in biology.DOME:生物学中监督式机器学习验证的建议
Nat Methods. 2021 Oct;18(10):1122-1127. doi: 10.1038/s41592-021-01205-4.
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432. doi: 10.1093/nar/gky995.
4
MobiDB-lite: fast and highly specific consensus prediction of intrinsic disorder in proteins.MobiDB-lite:蛋白质内在无序的快速且高度特异的一致性预测
Bioinformatics. 2017 May 1;33(9):1402-1404. doi: 10.1093/bioinformatics/btx015.
5
PHD2 Targeting Overcomes Breast Cancer Cell Death upon Glucose Starvation in a PP2A/B55α-Mediated Manner.靶向PHD2通过PP2A/B55α介导的方式克服葡萄糖饥饿诱导的乳腺癌细胞死亡。
Cell Rep. 2017 Mar 21;18(12):2836-2844. doi: 10.1016/j.celrep.2017.02.081.
6
FELLS: fast estimator of latent local structure.FELLS:潜在局部结构的快速估计器。
Bioinformatics. 2017 Jun 15;33(12):1889-1891. doi: 10.1093/bioinformatics/btx085.
7
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.
8
Proteomic analysis reveals diverse proline hydroxylation-mediated oxygen-sensing cellular pathways in cancer cells.蛋白质组学分析揭示了癌细胞中多种脯氨酸羟化介导的氧感应细胞途径。
Oncotarget. 2016 Nov 29;7(48):79154-79169. doi: 10.18632/oncotarget.12632.
9
pVHL suppresses kinase activity of Akt in a proline-hydroxylation-dependent manner.pVHL以脯氨酸羟化依赖的方式抑制Akt的激酶活性。
Science. 2016 Aug 26;353(6302):929-32. doi: 10.1126/science.aad5755.
10
VHLdb: A database of von Hippel-Lindau protein interactors and mutations.VHLdb:一种与希佩尔-林道蛋白相互作用因子及突变相关的数据库。
Sci Rep. 2016 Aug 11;6:31128. doi: 10.1038/srep31128.