基于串联质谱预测的翻译后修饰的计算精修。

Computational refinement of post-translational modifications predicted from tandem mass spectrometry.

机构信息

Department of Computer Science, University of Toronto, Toronto, Canada.

出版信息

Bioinformatics. 2011 Mar 15;27(6):797-806. doi: 10.1093/bioinformatics/btr017. Epub 2011 Jan 22.

DOI:10.1093/bioinformatics/btr017

PMID:21258065

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3051323/

Abstract

MOTIVATION

A post-translational modification (PTM) is a chemical modification of a protein that occurs naturally. Many of these modifications, such as phosphorylation, are known to play pivotal roles in the regulation of protein function. Henceforth, PTM perturbations have been linked to diverse diseases like Parkinson's, Alzheimer's, diabetes and cancer. To discover PTMs on a genome-wide scale, there is a recent surge of interest in analyzing tandem mass spectrometry data, and several unrestrictive (so-called 'blind') PTM search methods have been reported. However, these approaches are subject to noise in mass measurements and in the predicted modification site (amino acid position) within peptides, which can result in false PTM assignments.

RESULTS

To address these issues, we devised a machine learning algorithm, PTMClust, that can be applied to the output of blind PTM search methods to improve prediction quality, by suppressing noise in the data and clustering peptides with the same underlying modification to form PTM groups. We show that our technique outperforms two standard clustering algorithms on a simulated dataset. Additionally, we show that our algorithm significantly improves sensitivity and specificity when applied to the output of three different blind PTM search engines, SIMS, InsPecT and MODmap. Additionally, PTMClust markedly outperforms another PTM refinement algorithm, PTMFinder. We demonstrate that our technique is able to reduce false PTM assignments, improve overall detection coverage and facilitate novel PTM discovery, including terminus modifications. We applied our technique to a large-scale yeast MS/MS proteome profiling dataset and found numerous known and novel PTMs. Accurately identifying modifications in protein sequences is a critical first step for PTM profiling, and thus our approach may benefit routine proteomic analysis.

AVAILABILITY

Our algorithm is implemented in Matlab and is freely available for academic use. The software is available online from http://genes.toronto.edu.

摘要

动机

翻译后修饰 (PTM) 是蛋白质的一种化学修饰，它是自然发生的。许多这样的修饰，如磷酸化，已知在蛋白质功能的调节中起着关键作用。因此，PTM 干扰与帕金森病、阿尔茨海默病、糖尿病和癌症等多种疾病有关。为了在全基因组范围内发现 PTM，人们最近对分析串联质谱数据产生了浓厚的兴趣，并且已经报道了几种无限制（所谓的“盲目”）PTM 搜索方法。然而，这些方法受到质谱测量和肽中预测修饰位点（氨基酸位置）的噪声的影响，这可能导致错误的 PTM 分配。

结果

为了解决这些问题，我们设计了一种机器学习算法 PTMClust，它可以应用于盲目 PTM 搜索方法的输出，通过抑制数据中的噪声并将具有相同潜在修饰的肽聚类形成 PTM 组，从而提高预测质量。我们表明，我们的技术在模拟数据集上优于两种标准聚类算法。此外，当应用于三个不同的盲目 PTM 搜索引擎 SIMS、InsPecT 和 MODmap 的输出时，我们的算法显著提高了灵敏度和特异性。此外，PTMClust 明显优于另一种 PTM 精炼算法 PTMFinder。我们证明我们的技术能够减少错误的 PTM 分配，提高整体检测覆盖率并促进新的 PTM 发现，包括末端修饰。我们将我们的技术应用于大规模酵母 MS/MS 蛋白质组谱数据集，发现了许多已知和新的 PTM。准确识别蛋白质序列中的修饰是 PTM 分析的关键第一步，因此我们的方法可能有益于常规蛋白质组学分析。

可用性

我们的算法是用 Matlab 实现的，可供学术使用。该软件可从 http://genes.toronto.edu 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2f2/3051323/ad0a86ffedd4/btr017f1.jpg

相似文献

Computational refinement of post-translational modifications predicted from tandem mass spectrometry.基于串联质谱预测的翻译后修饰的计算精修。

Bioinformatics. 2011 Mar 15;27(6):797-806. doi: 10.1093/bioinformatics/btr017. Epub 2011 Jan 22.

Non-parametric Bayesian approach to post-translational modification refinement of predictions from tandem mass spectrometry.基于非参数贝叶斯方法的串联质谱预测后翻译修饰精修。

Bioinformatics. 2013 Apr 1;29(7):821-9. doi: 10.1093/bioinformatics/btt056. Epub 2013 Feb 17.

VEMS 3.0: algorithms and computational tools for tandem mass spectrometry based identification of post-translational modifications in proteins.VEMS 3.0：用于基于串联质谱法鉴定蛋白质翻译后修饰的算法和计算工具

J Proteome Res. 2005 Nov-Dec;4(6):2338-47. doi: 10.1021/pr050264q.

Sequential interval motif search: unrestricted database surveys of global MS/MS data sets for detection of putative post-translational modifications.序列间隔基序搜索：对全局串联质谱数据集进行无限制数据库搜索以检测假定的翻译后修饰。

Anal Chem. 2008 Oct 15;80(20):7846-54. doi: 10.1021/ac8009017. Epub 2008 Sep 13.

Prediction of novel modifications by unrestrictive search of tandem mass spectra.通过串联质谱的无限制搜索预测新型修饰。

J Proteome Res. 2009 Oct;8(10):4418-27. doi: 10.1021/pr9001146.

Identification of post-translational modifications by blind search of mass spectra.通过对质谱进行盲目搜索来鉴定翻译后修饰。

Nat Biotechnol. 2005 Dec;23(12):1562-7. doi: 10.1038/nbt1168. Epub 2005 Nov 27.

PTMTreeSearch: a novel two-stage tree-search algorithm with pruning rules for the identification of post-translational modification of proteins in MS/MS spectra.PTMTreeSearch：一种新颖的两阶段树搜索算法，具有修剪规则，用于鉴定 MS/MS 谱中蛋白质的翻译后修饰。

Bioinformatics. 2014 Jan 15;30(2):234-41. doi: 10.1093/bioinformatics/btt642. Epub 2013 Nov 8.

Software eyes for protein post-translational modifications.用于蛋白质翻译后修饰的软件工具

Mass Spectrom Rev. 2015 Mar-Apr;34(2):133-47. doi: 10.1002/mas.21425. Epub 2014 Jun 2.

Identification of post-translational modifications via blind search of mass-spectra.通过对质谱进行盲目搜索来鉴定翻译后修饰。

Proc IEEE Comput Syst Bioinform Conf. 2005:157-66. doi: 10.1109/csb.2005.34.

Demystifying PTM Identification Using MODplus: Best Practices and Pitfalls.使用 MODplus 揭开 PTM 鉴定的神秘面纱：最佳实践和陷阱。

Methods Mol Biol. 2024;2836:37-55. doi: 10.1007/978-1-0716-4007-4_3.

引用本文的文献

Post-translational modification prediction via prompt-based fine-tuning of a GPT-2 model.基于提示的 GPT-2 模型微调进行翻译后修饰预测。

Nat Commun. 2024 Aug 7;15(1):6699. doi: 10.1038/s41467-024-51071-9.

PTMiner: Localization and Quality Control of Protein Modifications Detected in an Open Search and Its Application to Comprehensive Post-translational Modification Characterization in Human Proteome.PTMiner：在开放搜索中检测到的蛋白质修饰的定位和质量控制及其在人类蛋白质组中全面翻译后修饰特征中的应用。

Mol Cell Proteomics. 2019 Feb;18(2):391-405. doi: 10.1074/mcp.RA118.000812. Epub 2018 Nov 12.

CarSPred: a computational tool for predicting carbonylation sites of human proteins.CarSPred：一种预测人类蛋白质羰基化位点的计算工具。

PLoS One. 2014 Oct 27;9(10):e111478. doi: 10.1371/journal.pone.0111478. eCollection 2014.

Structure predictions of two Bauhinia variegata lectins reveal patterns of C-terminal properties in single chain legume lectins.两种羊蹄甲属植物凝集素的结构预测揭示了单链豆科植物凝集素 C 末端特性的模式。

PLoS One. 2013 Nov 19;8(11):e81338. doi: 10.1371/journal.pone.0081338. eCollection 2013.

Position-specific analysis and prediction for protein lysine acetylation based on multiple features.基于多种特征的蛋白质赖氨酸乙酰化的位置特异性分析和预测。

PLoS One. 2012;7(11):e49108. doi: 10.1371/journal.pone.0049108. Epub 2012 Nov 16.

本文引用的文献

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

A novel approach for untargeted post-translational modification identification using integer linear optimization and tandem mass spectrometry.一种利用整数线性优化和串联质谱进行非靶向翻译后修饰鉴定的新方法。

Mol Cell Proteomics. 2010 May;9(5):764-79. doi: 10.1074/mcp.M900487-MCP200. Epub 2010 Jan 26.

Prediction of novel modifications by unrestrictive search of tandem mass spectra.通过串联质谱的无限制搜索预测新型修饰。

J Proteome Res. 2009 Oct;8(10):4418-27. doi: 10.1021/pr9001146.

PTMap--a sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites.PTMap——一款用于无限制、准确且全谱识别翻译后修饰位点的序列比对软件。

Proc Natl Acad Sci U S A. 2009 Jan 20;106(3):761-6. doi: 10.1073/pnas.0811739106. Epub 2009 Jan 9.

Anal Chem. 2008 Oct 15;80(20):7846-54. doi: 10.1021/ac8009017. Epub 2008 Sep 13.

SeMoP: a new computational strategy for the unrestricted search for modified peptides using LC-MS/MS data.SeMoP：一种利用液相色谱-串联质谱数据无限制搜索修饰肽段的新计算策略。

J Proteome Res. 2008 Sep;7(9):4199-208. doi: 10.1021/pr800277y. Epub 2008 Aug 8.

Comprehensive comparison of collision induced dissociation and electron transfer dissociation.碰撞诱导解离与电子转移解离的综合比较

Anal Chem. 2008 Jul 1;80(13):4825-35. doi: 10.1021/ac8007785. Epub 2008 Jun 10.

Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.使用诱饵数据库对通过串联质谱鉴定的肽段赋予显著性。

J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.

Accurate annotation of peptide modifications through unrestrictive database search.通过无限制数据库搜索对肽修饰进行准确注释。

J Proteome Res. 2008 Jan;7(1):170-81. doi: 10.1021/pr070444v. Epub 2007 Nov 23.

Mapping protein post-translational modifications with mass spectrometry.利用质谱法绘制蛋白质翻译后修饰图谱。

Nat Methods. 2007 Oct;4(10):798-806. doi: 10.1038/nmeth1100.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于串联质谱预测的翻译后修饰的计算精修。

Computational refinement of post-translational modifications predicted from tandem mass spectrometry.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献