Suppr超能文献

MoMo:具有统计学意义的翻译后修饰基序的发现。

MoMo: discovery of statistically significant post-translational modification motifs.

作者信息

Cheng Alice, Grant Charles E, Noble William S, Bailey Timothy L

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA, USA.

Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA.

出版信息

Bioinformatics. 2019 Aug 15;35(16):2774-2782. doi: 10.1093/bioinformatics/bty1058.

Abstract

MOTIVATION

Post-translational modifications (PTMs) of proteins are associated with many significant biological functions and can be identified in high throughput using tandem mass spectrometry. Many PTMs are associated with short sequence patterns called 'motifs' that help localize the modifying enzyme. Accordingly, many algorithms have been designed to identify these motifs from mass spectrometry data. Accurate statistical confidence estimates for discovered motifs are critically important for proper interpretation and in the design of downstream experimental validation.

RESULTS

We describe a method for assigning statistical confidence estimates to PTM motifs, and we demonstrate that this method provides accurate P-values on both simulated and real data. Our methods are implemented in MoMo, a software tool for discovering motifs among sets of PTMs that we make available as a web server and as downloadable source code. MoMo re-implements the two most widely used PTM motif discovery algorithms-motif-x and MoDL-while offering many enhancements. Relative to motif-x, MoMo offers improved statistical confidence estimates and more accurate calculation of motif scores. The MoMo web server offers more proteome databases, more input formats, larger inputs and longer running times than the motif-x web server. Finally, our study demonstrates that the confidence estimates produced by motif-x are inaccurate. This inaccuracy stems in part from the common practice of drawing 'background' peptides from an unshuffled proteome database. Our results thus suggest that many of the papers that use motif-x to find motifs may be reporting results that lack statistical support.

AVAILABILITY AND IMPLEMENTATION

The MoMo web server and source code are provided at http://meme-suite.org.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质的翻译后修饰(PTM)与许多重要的生物学功能相关,并且可以使用串联质谱进行高通量鉴定。许多PTM与称为“基序”的短序列模式相关,这些基序有助于定位修饰酶。因此,已经设计了许多算法来从质谱数据中识别这些基序。对发现的基序进行准确的统计置信度估计对于正确解释和下游实验验证的设计至关重要。

结果

我们描述了一种为PTM基序分配统计置信度估计的方法,并证明该方法在模拟数据和真实数据上都能提供准确的P值。我们的方法在MoMo中实现,MoMo是一种用于在PTM集合中发现基序的软件工具,我们将其作为网络服务器和可下载的源代码提供。MoMo重新实现了两种最广泛使用的PTM基序发现算法——Motif-X和MoDL,同时提供了许多增强功能。相对于Motif-X,MoMo提供了改进的统计置信度估计和更准确的基序分数计算。MoMo网络服务器比Motif-X网络服务器提供更多的蛋白质组数据库、更多的输入格式、更大的输入和更长的运行时间。最后,我们的研究表明Motif-X产生的置信度估计不准确。这种不准确部分源于从未洗牌的蛋白质组数据库中提取“背景”肽的常见做法。因此,我们的结果表明,许多使用Motif-X来寻找基序的论文可能报告的结果缺乏统计支持。

可用性和实现方式

MoMo网络服务器和源代码可在http://meme-suite.org获得。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

2
STREME: accurate and versatile sequence motif discovery.STREME:准确且通用的序列基序发现。
Bioinformatics. 2021 Sep 29;37(18):2834-2840. doi: 10.1093/bioinformatics/btab203.
4
MEME SUITE: tools for motif discovery and searching.MEME套件:用于基序发现和搜索的工具。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8. doi: 10.1093/nar/gkp335. Epub 2009 May 20.
6
Biological sequence motif discovery using motif-x.使用Motif-X进行生物序列基序发现。
Curr Protoc Bioinformatics. 2011 Sep;Chapter 13:13.15.1-13.15.24. doi: 10.1002/0471250953.bi1315s35.
8
MMFPh: a maximal motif finder for phosphoproteomics datasets.MMFPh:用于磷酸化蛋白质组学数据集的最大基序发现器。
Bioinformatics. 2012 Jun 15;28(12):1562-70. doi: 10.1093/bioinformatics/bts195. Epub 2012 Apr 23.
10
MEME: discovering and analyzing DNA and protein sequence motifs.MEME:发现和分析DNA与蛋白质序列基序
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W369-73. doi: 10.1093/nar/gkl198.

引用本文的文献

4
10
Gas-phase fractionation DDA promotes in-depth DIA phosphoproteome analysis.气相分级DDA促进深度DIA磷酸化蛋白质组分析。
Heliyon. 2025 Jan 14;11(2):e41928. doi: 10.1016/j.heliyon.2025.e41928. eCollection 2025 Jan 30.

本文引用的文献

2
The eukaryotic linear motif resource - 2018 update.真核线性基序资源 - 2018 更新版。
Nucleic Acids Res. 2018 Jan 4;46(D1):D428-D434. doi: 10.1093/nar/gkx1077.
5
Mining Conditional Phosphorylation Motifs.挖掘条件性磷酸化基序
IEEE/ACM Trans Comput Biol Bioinform. 2014 Sep-Oct;11(5):915-27. doi: 10.1109/TCBB.2014.2321400.
6
MMFPh: a maximal motif finder for phosphoproteomics datasets.MMFPh:用于磷酸化蛋白质组学数据集的最大基序发现器。
Bioinformatics. 2012 Jun 15;28(12):1562-70. doi: 10.1093/bioinformatics/bts195. Epub 2012 Apr 23.
8
Biological sequence motif discovery using motif-x.使用Motif-X进行生物序列基序发现。
Curr Protoc Bioinformatics. 2011 Sep;Chapter 13:13.15.1-13.15.24. doi: 10.1002/0471250953.bi1315s35.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验