• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Motif-All:发现所有磷酸化基序。

Motif-All: discovering all phosphorylation motifs.

机构信息

School of Software, Dalian University of Technology, Dalian, China.

出版信息

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S22. doi: 10.1186/1471-2105-12-S1-S22.

DOI:10.1186/1471-2105-12-S1-S22
PMID:21342552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3044277/
Abstract

BACKGROUND

Phosphorylation motifs represent common patterns around the phosphorylation site. The discovery of such kinds of motifs reveals the underlying regulation mechanism and facilitates the prediction of unknown phosphorylation event. To date, people have gathered large amounts of phosphorylation data, making it possible to perform substrate-driven motif discovery using data mining techniques.

RESULTS

We describe an algorithm called Motif-All that is able to efficiently identify all statistically significant motifs. The proposed method explores a support constraint to reduce search space and avoid generating random artifacts. As the number of phosphorylated peptides are far less than that of unphosphorylated ones, we divide the mining process into two stages: The first step generates candidates from the set of phosphorylated sequences using only support constraint and the second step tests the statistical significance of each candidate using the odds ratio derived from the whole data set. Experimental results on real data show that Motif-All outperforms current algorithms in terms of both effectiveness and efficiency.

CONCLUSIONS

Motif-All is a useful tool for discovering statistically significant phosphorylation motifs. Source codes and data sets are available at: http://bioinformatics.ust.hk/MotifAll.rar.

摘要

背景

磷酸化模体是围绕磷酸化位点的常见模式。这类模体的发现揭示了潜在的调控机制,并有助于预测未知的磷酸化事件。迄今为止,人们已经收集了大量的磷酸化数据,使得使用数据挖掘技术进行基于底物的模体发现成为可能。

结果

我们描述了一种名为 Motif-All 的算法,它能够有效地识别所有具有统计学意义的模体。该方法通过探索支持约束来减少搜索空间,避免产生随机伪影。由于磷酸化肽的数量远远少于非磷酸化肽的数量,我们将挖掘过程分为两个阶段:第一步仅使用支持约束从磷酸化序列集中生成候选序列,第二步使用从整个数据集推导的比值比测试每个候选序列的统计学意义。在真实数据上的实验结果表明,Motif-All 在有效性和效率方面均优于当前的算法。

结论

Motif-All 是一种用于发现具有统计学意义的磷酸化模体的有用工具。源代码和数据集可在以下网址获取:http://bioinformatics.ust.hk/MotifAll.rar。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/3d7f13894f0c/1471-2105-12-S1-S22-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/fed39124a204/1471-2105-12-S1-S22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/82ba7738c624/1471-2105-12-S1-S22-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/4d09cc00487c/1471-2105-12-S1-S22-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/7135f03a4b2c/1471-2105-12-S1-S22-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/3d7f13894f0c/1471-2105-12-S1-S22-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/fed39124a204/1471-2105-12-S1-S22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/82ba7738c624/1471-2105-12-S1-S22-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/4d09cc00487c/1471-2105-12-S1-S22-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/7135f03a4b2c/1471-2105-12-S1-S22-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/111f/3044277/3d7f13894f0c/1471-2105-12-S1-S22-5.jpg

相似文献

1
Motif-All: discovering all phosphorylation motifs.Motif-All:发现所有磷酸化基序。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S22. doi: 10.1186/1471-2105-12-S1-S22.
2
Mining Conditional Phosphorylation Motifs.挖掘条件性磷酸化基序
IEEE/ACM Trans Comput Biol Bioinform. 2014 Sep-Oct;11(5):915-27. doi: 10.1109/TCBB.2014.2321400.
3
WildSpan: mining structured motifs from protein sequences.WildSpan:从蛋白质序列中挖掘结构化基序
Algorithms Mol Biol. 2011 Mar 31;6(1):6. doi: 10.1186/1748-7188-6-6.
4
IncMD: incremental trie-based structural motif discovery algorithm.IncMD:基于增量前缀树的结构基序发现算法。
J Bioinform Comput Biol. 2014 Oct;12(5):1450027. doi: 10.1142/S0219720014500279.
5
Discovery of protein phosphorylation motifs through exploratory data analysis.通过探索性数据分析发现蛋白质磷酸化基序。
PLoS One. 2011;6(5):e20025. doi: 10.1371/journal.pone.0020025. Epub 2011 May 25.
6
A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs.基于蒙特卡罗的框架增强了调控序列基序的发现和解释。
BMC Bioinformatics. 2012 Nov 27;13:317. doi: 10.1186/1471-2105-13-317.
7
Discovering sequence motifs.发现序列基序。
Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12.
8
A novel motif discovery algorithm for identifying protein families.一种用于识别蛋白质家族的新型基序发现算法。
Protein Pept Lett. 2010 Oct;17(10):1215-22. doi: 10.2174/092986610792231429.
9
Discovery of phosphorylation motif mixtures in phosphoproteomics data.在磷酸化蛋白质组学数据中发现磷酸化基序混合物。
Bioinformatics. 2009 Jan 1;25(1):14-21. doi: 10.1093/bioinformatics/btn569. Epub 2008 Nov 7.
10
EMS3: An Improved Algorithm for Finding Edit-Distance Based Motifs.EMS3:一种用于寻找基于编辑距离的基序的改进算法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):27-37. doi: 10.1109/TCBB.2020.3024222. Epub 2021 Feb 3.

引用本文的文献

1
GraphPhos: Predict Protein-Phosphorylation Sites Based on Graph Neural Networks.GraphPhos:基于图神经网络预测蛋白质磷酸化位点
Int J Mol Sci. 2025 Jan 23;26(3):941. doi: 10.3390/ijms26030941.
2
PTMoreR-enabled cross-species PTM mapping and comparative phosphoproteomics across mammals.PTMoreR 增强的跨物种 PTM 映射和哺乳动物的比较磷酸蛋白质组学。
Cell Rep Methods. 2024 Sep 16;4(9):100859. doi: 10.1016/j.crmeth.2024.100859. Epub 2024 Sep 9.
3
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.

本文引用的文献

1
PostMod: sequence based prediction of kinase-specific phosphorylation sites with indirect relationship.PostMod:具有间接关系的激酶特异性磷酸化位点的基于序列的预测。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2105-11-S1-S10.
2
PhosPhAt: the Arabidopsis thaliana phosphorylation site database. An update.PhosPhAt:拟南芥磷酸化位点数据库。更新。
Nucleic Acids Res. 2010 Jan;38(Database issue):D828-34. doi: 10.1093/nar/gkp810. Epub 2009 Oct 30.
3
Discovery of phosphorylation motif mixtures in phosphoproteomics data.
机器学习和算法方法在蛋白质磷酸化位点预测中的研究进展综述
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.
4
Proteomics and Phospho-Proteomics Profiling of the Co-Formulation of Type I and II Interferons, HeberFERON, in the Glioblastoma-Derived Cell Line U-87 MG.I 型和 II 型干扰素共制剂 HeberFERON 在胶质母细胞瘤衍生细胞系 U-87 MG 中的蛋白质组学和磷酸化蛋白质组学分析。
Cells. 2022 Dec 15;11(24):4068. doi: 10.3390/cells11244068.
5
TcdB of Mediates RAS-Dependent Necrosis in Epithelial Cells.Mediates RAS-Dependent Necrosis in Epithelial Cells.
Int J Mol Sci. 2022 Apr 12;23(8):4258. doi: 10.3390/ijms23084258.
6
Isotopically Dimethyl Labeling-Based Quantitative Proteomic Analysis of Phosphoproteomes of Soybean Cultivars.基于同位素二甲基标记的大豆品种磷酸蛋白质组定量蛋白质组学分析。
Biomolecules. 2021 Aug 16;11(8):1218. doi: 10.3390/biom11081218.
7
Phosphorylation Site Motifs in Plant Protein Kinases and Their Substrates.植物蛋白激酶及其底物中的磷酸化位点基序
Methods Mol Biol. 2021;2358:1-16. doi: 10.1007/978-1-0716-1625-3_1.
8
3D Functional Genomics Screens Identify CREBBP as a Targetable Driver in Aggressive Triple-Negative Breast Cancer.3D功能基因组学筛选确定CREBBP是侵袭性三阴性乳腺癌的一个可靶向驱动因子。
Cancer Res. 2021 Feb 15;81(4):847-859. doi: 10.1158/0008-5472.CAN-20-1822. Epub 2021 Jan 28.
9
Characterization of Kinase Expression Related to Increased Migration of PC-3M Cells Using Global Comparative Phosphoproteome Analysis.采用全比较磷酸化蛋白质组分析鉴定与 PC-3M 细胞迁移增加相关的激酶表达特征。
Cancer Genomics Proteomics. 2020 Sep-Oct;17(5):543-553. doi: 10.21873/cgp.20210.
10
Molecular dissection of Chagas induced cardiomyopathy reveals central disease associated and druggable signaling pathways.解析恰加斯病引发的心肌病的分子机制揭示了与疾病相关的核心信号通路和有药物治疗潜力的靶点。
PLoS Negl Trop Dis. 2020 May 20;14(5):e0007980. doi: 10.1371/journal.pntd.0007980. eCollection 2020 May.
在磷酸化蛋白质组学数据中发现磷酸化基序混合物。
Bioinformatics. 2009 Jan 1;25(1):14-21. doi: 10.1093/bioinformatics/btn569. Epub 2008 Nov 7.
4
Predicting protein post-translational modifications using meta-analysis of proteome scale data sets.利用蛋白质组规模数据集的荟萃分析预测蛋白质翻译后修饰
Mol Cell Proteomics. 2009 Feb;8(2):365-79. doi: 10.1074/mcp.M800332-MCP200. Epub 2008 Oct 28.
5
PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor.PhosPhAt:拟南芥磷酸化位点数据库及植物特异性磷酸化位点预测工具
Nucleic Acids Res. 2008 Jan;36(Database issue):D1015-21. doi: 10.1093/nar/gkm812. Epub 2007 Nov 4.
6
A curated compendium of phosphorylation motifs.一个经过整理的磷酸化基序汇编。
Nat Biotechnol. 2007 Mar;25(3):285-6. doi: 10.1038/nbt0307-285.
7
PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theory.PPSP:基于贝叶斯决策理论的PK特异性磷酸化位点预测
BMC Bioinformatics. 2006 Mar 20;7:163. doi: 10.1186/1471-2105-7-163.
8
An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets.一种从大规模数据集中识别蛋白质磷酸化基序的迭代统计方法。
Nat Biotechnol. 2005 Nov;23(11):1391-8. doi: 10.1038/nbt1146.
9
Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm.生物序列中的组合模式发现:TEIRESIAS算法。
Bioinformatics. 1998;14(1):55-67. doi: 10.1093/bioinformatics/14.1.55.