MoMo：具有统计学意义的翻译后修饰基序的发现。

MoMo: discovery of statistically significant post-translational modification motifs.

作者信息

Cheng Alice, Grant Charles E, Noble William S, Bailey Timothy L

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA, USA.

Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA.

出版信息

Bioinformatics. 2019 Aug 15;35(16):2774-2782. doi: 10.1093/bioinformatics/bty1058.

DOI:10.1093/bioinformatics/bty1058

PMID:30596994

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6691336/

Abstract

MOTIVATION

Post-translational modifications (PTMs) of proteins are associated with many significant biological functions and can be identified in high throughput using tandem mass spectrometry. Many PTMs are associated with short sequence patterns called 'motifs' that help localize the modifying enzyme. Accordingly, many algorithms have been designed to identify these motifs from mass spectrometry data. Accurate statistical confidence estimates for discovered motifs are critically important for proper interpretation and in the design of downstream experimental validation.

RESULTS

We describe a method for assigning statistical confidence estimates to PTM motifs, and we demonstrate that this method provides accurate P-values on both simulated and real data. Our methods are implemented in MoMo, a software tool for discovering motifs among sets of PTMs that we make available as a web server and as downloadable source code. MoMo re-implements the two most widely used PTM motif discovery algorithms-motif-x and MoDL-while offering many enhancements. Relative to motif-x, MoMo offers improved statistical confidence estimates and more accurate calculation of motif scores. The MoMo web server offers more proteome databases, more input formats, larger inputs and longer running times than the motif-x web server. Finally, our study demonstrates that the confidence estimates produced by motif-x are inaccurate. This inaccuracy stems in part from the common practice of drawing 'background' peptides from an unshuffled proteome database. Our results thus suggest that many of the papers that use motif-x to find motifs may be reporting results that lack statistical support.

AVAILABILITY AND IMPLEMENTATION

The MoMo web server and source code are provided at http://meme-suite.org.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质的翻译后修饰（PTM）与许多重要的生物学功能相关，并且可以使用串联质谱进行高通量鉴定。许多PTM与称为“基序”的短序列模式相关，这些基序有助于定位修饰酶。因此，已经设计了许多算法来从质谱数据中识别这些基序。对发现的基序进行准确的统计置信度估计对于正确解释和下游实验验证的设计至关重要。

结果

我们描述了一种为PTM基序分配统计置信度估计的方法，并证明该方法在模拟数据和真实数据上都能提供准确的P值。我们的方法在MoMo中实现，MoMo是一种用于在PTM集合中发现基序的软件工具，我们将其作为网络服务器和可下载的源代码提供。MoMo重新实现了两种最广泛使用的PTM基序发现算法——Motif-X和MoDL，同时提供了许多增强功能。相对于Motif-X，MoMo提供了改进的统计置信度估计和更准确的基序分数计算。MoMo网络服务器比Motif-X网络服务器提供更多的蛋白质组数据库、更多的输入格式、更大的输入和更长的运行时间。最后，我们的研究表明Motif-X产生的置信度估计不准确。这种不准确部分源于从未洗牌的蛋白质组数据库中提取“背景”肽的常见做法。因此，我们的结果表明，许多使用Motif-X来寻找基序的论文可能报告的结果缺乏统计支持。

可用性和实现方式

MoMo网络服务器和源代码可在http://meme-suite.org获得。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

MoMo: discovery of statistically significant post-translational modification motifs.MoMo：具有统计学意义的翻译后修饰基序的发现。

Bioinformatics. 2019 Aug 15;35(16):2774-2782. doi: 10.1093/bioinformatics/bty1058.

STREME: accurate and versatile sequence motif discovery.STREME：准确且通用的序列基序发现。

Bioinformatics. 2021 Sep 29;37(18):2834-2840. doi: 10.1093/bioinformatics/btab203.

Non-parametric Bayesian approach to post-translational modification refinement of predictions from tandem mass spectrometry.基于非参数贝叶斯方法的串联质谱预测后翻译修饰精修。

Bioinformatics. 2013 Apr 1;29(7):821-9. doi: 10.1093/bioinformatics/btt056. Epub 2013 Feb 17.

MEME SUITE: tools for motif discovery and searching.MEME套件：用于基序发现和搜索的工具。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8. doi: 10.1093/nar/gkp335. Epub 2009 May 20.

PTMTreeSearch: a novel two-stage tree-search algorithm with pruning rules for the identification of post-translational modification of proteins in MS/MS spectra.PTMTreeSearch：一种新颖的两阶段树搜索算法，具有修剪规则，用于鉴定 MS/MS 谱中蛋白质的翻译后修饰。

Bioinformatics. 2014 Jan 15;30(2):234-41. doi: 10.1093/bioinformatics/btt642. Epub 2013 Nov 8.

Biological sequence motif discovery using motif-x.使用Motif-X进行生物序列基序发现。

Curr Protoc Bioinformatics. 2011 Sep;Chapter 13:13.15.1-13.15.24. doi: 10.1002/0471250953.bi1315s35.

Computational refinement of post-translational modifications predicted from tandem mass spectrometry.基于串联质谱预测的翻译后修饰的计算精修。

Bioinformatics. 2011 Mar 15;27(6):797-806. doi: 10.1093/bioinformatics/btr017. Epub 2011 Jan 22.

MMFPh: a maximal motif finder for phosphoproteomics datasets.MMFPh：用于磷酸化蛋白质组学数据集的最大基序发现器。

Bioinformatics. 2012 Jun 15;28(12):1562-70. doi: 10.1093/bioinformatics/bts195. Epub 2012 Apr 23.

Systematic characterization and prediction of post-translational modification cross-talk between proteins.系统地描述和预测蛋白质之间的翻译后修饰交叉对话。

Bioinformatics. 2019 Aug 1;35(15):2626-2633. doi: 10.1093/bioinformatics/bty1033.

MEME: discovering and analyzing DNA and protein sequence motifs.MEME：发现和分析DNA与蛋白质序列基序

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W369-73. doi: 10.1093/nar/gkl198.

引用本文的文献

Comprehensive Ubiquitome Analysis of Leaves Infected with Tomato Brown Rugose Fruit Virus.感染番茄褐色皱纹果病毒的叶片的全面泛素组分析

Biology (Basel). 2025 Jun 5;14(6):656. doi: 10.3390/biology14060656.

A Systematic Study of Lysine Succinylation in the Pathogenic Bacterium in Aquatic Animals.水生动物致病细菌中赖氨酸琥珀酰化的系统研究。

Molecules. 2025 May 31;30(11):2418. doi: 10.3390/molecules30112418.

Targeting PLK1-CBX8-GPX4 axis overcomes BRAF/EGFR inhibitor resistance in BRAFV600E colorectal cancer via ferroptosis.靶向PLK1-CBX8-GPX4轴通过铁死亡克服BRAFV600E结直肠癌中的BRAF/EGFR抑制剂耐药性。

Nat Commun. 2025 Apr 16;16(1):3605. doi: 10.1038/s41467-025-58992-z.

Tauopathy after long-term cervical lymphadenectomy.长期颈部淋巴结清扫术后的tau蛋白病

Alzheimers Dement. 2025 Apr;21(4):e70136. doi: 10.1002/alz.70136.

Pupylation-Based Proximity Labeling Unravels a Comprehensive Protein and Phosphoprotein Interactome of the Arabidopsis TOR Complex.基于 pupylation 的邻近标记揭示了拟南芥 TOR 复合体完整的蛋白质和磷酸化蛋白质相互作用组。

Adv Sci (Weinh). 2025 May;12(19):e2414496. doi: 10.1002/advs.202414496. Epub 2025 Mar 24.

Spatiotemporal single-cell architecture of gene expression in the Caenorhabditis elegans germ cells.秀丽隐杆线虫生殖细胞中基因表达的时空单细胞结构

Cell Discov. 2025 Mar 18;11(1):26. doi: 10.1038/s41421-025-00790-4.

Dynamic proteome and acetylome profiling reveals key regulators of sucrose accumulation in sugarcane.动态蛋白质组和乙酰化蛋白质组分析揭示甘蔗中蔗糖积累的关键调控因子。

Plant Cell Rep. 2025 Mar 17;44(4):74. doi: 10.1007/s00299-025-03449-2.

Quantitative Proteomic Analysis of Lysine Malonylation in Response to Salicylic Acid in the Roots of .水杨酸处理下[植物名称]根系中赖氨酸丙二酰化的定量蛋白质组学分析（原文中植物名称缺失）

Int J Mol Sci. 2025 Feb 6;26(3):1392. doi: 10.3390/ijms26031392.

Dynamic global acetylation remodeling during the yeast heat shock response.酵母热休克反应过程中的动态全局乙酰化重塑

bioRxiv. 2025 Jan 10:2025.01.10.632339. doi: 10.1101/2025.01.10.632339.

Gas-phase fractionation DDA promotes in-depth DIA phosphoproteome analysis.气相分级DDA促进深度DIA磷酸化蛋白质组分析。

Heliyon. 2025 Jan 14;11(2):e41928. doi: 10.1016/j.heliyon.2025.e41928. eCollection 2025 Jan 30.

本文引用的文献

Characterization of Plasmodium falciparum Atypical Kinase PfPK7 Dependent Phosphoproteome.恶性疟原虫非典型激酶 PfPK7 依赖性磷酸化蛋白质组的特征分析。

J Proteome Res. 2018 Jun 1;17(6):2112-2123. doi: 10.1021/acs.jproteome.8b00062. Epub 2018 Apr 30.

The eukaryotic linear motif resource - 2018 update.真核线性基序资源 - 2018 更新版。

Nucleic Acids Res. 2018 Jan 4;46(D1):D428-D434. doi: 10.1093/nar/gkx1077.

Anatomy and evolution of database search engines-a central component of mass spectrometry based proteomic workflows.数据库搜索引擎的解剖结构和演化——基于质谱的蛋白质组学工作流程的核心组件。

Mass Spectrom Rev. 2020 May;39(3):292-306. doi: 10.1002/mas.21543. Epub 2017 Sep 13.

Uncovering Phosphorylation-Based Specificities through Functional Interaction Networks.通过功能相互作用网络揭示基于磷酸化的特异性

Mol Cell Proteomics. 2016 Jan;15(1):236-45. doi: 10.1074/mcp.M115.052357. Epub 2015 Nov 16.

Mining Conditional Phosphorylation Motifs.挖掘条件性磷酸化基序

IEEE/ACM Trans Comput Biol Bioinform. 2014 Sep-Oct;11(5):915-27. doi: 10.1109/TCBB.2014.2321400.

MMFPh: a maximal motif finder for phosphoproteomics datasets.MMFPh：用于磷酸化蛋白质组学数据集的最大基序发现器。

Bioinformatics. 2012 Jun 15;28(12):1562-70. doi: 10.1093/bioinformatics/bts195. Epub 2012 Apr 23.

PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse.磷酸化位点数据库：一个综合性资源，用于研究人和鼠中实验确定的翻译后修饰的结构和功能。

Nucleic Acids Res. 2012 Jan;40(Database issue):D261-70. doi: 10.1093/nar/gkr1122. Epub 2011 Dec 1.

Biological sequence motif discovery using motif-x.使用Motif-X进行生物序列基序发现。

Curr Protoc Bioinformatics. 2011 Sep;Chapter 13:13.15.1-13.15.24. doi: 10.1002/0471250953.bi1315s35.

A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.一种基于蛋白质组的、定量的体内泛素化位点调查揭示了广泛的调节作用。

Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. doi: 10.1074/mcp.M111.013284. Epub 2011 Sep 1.

Discovery of protein phosphorylation motifs through exploratory data analysis.通过探索性数据分析发现蛋白质磷酸化基序。

PLoS One. 2011;6(5):e20025. doi: 10.1371/journal.pone.0020025. Epub 2011 May 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验