真核生物中蛋白质调控基序的大规模发现与描述。

Large-scale discovery and characterization of protein regulatory motifs in eukaryotes.

机构信息

Department of Molecular Biology, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America.

出版信息

PLoS One. 2010 Dec 29;5(12):e14444. doi: 10.1371/journal.pone.0014444.

DOI:10.1371/journal.pone.0014444

PMID:21206902

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3012054/

Abstract

The increasing ability to generate large-scale, quantitative proteomic data has brought with it the challenge of analyzing such data to discover the sequence elements that underlie systems-level protein behavior. Here we show that short, linear protein motifs can be efficiently recovered from proteome-scale datasets such as sub-cellular localization, molecular function, half-life, and protein abundance data using an information theoretic approach. Using this approach, we have identified many known protein motifs, such as phosphorylation sites and localization signals, and discovered a large number of candidate elements. We estimate that ~80% of these are novel predictions in that they do not match a known motif in both sequence and biological context, suggesting that post-translational regulation of protein behavior is still largely unexplored. These predicted motifs, many of which display preferential association with specific biological pathways and non-random positioning in the linear protein sequence, provide focused hypotheses for experimental validation.

摘要

大规模、定量蛋白质组学数据的生成能力不断提高，这带来了分析这些数据以发现系统水平蛋白质行为的序列元素的挑战。在这里，我们展示了使用信息论方法可以从细胞内定位、分子功能、半衰期和蛋白质丰度等蛋白质组学规模数据集高效地回收短线性蛋白质基序。使用这种方法，我们已经鉴定了许多已知的蛋白质基序，如磷酸化位点和定位信号，并发现了大量候选元素。我们估计这些元素中有~80%是新的预测，因为它们在序列和生物学背景上都与已知的基序不匹配，这表明蛋白质行为的翻译后调控仍在很大程度上未被探索。这些预测的基序，其中许多与特定的生物途径具有优先相关性，并且在线性蛋白质序列中的位置是非随机的，为实验验证提供了有针对性的假说。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/098a/3012054/b6a9ec80bf23/pone.0014444.g001.jpg

相似文献

Large-scale discovery and characterization of protein regulatory motifs in eukaryotes.

PLoS One. 2010 Dec 29;5(12):e14444. doi: 10.1371/journal.pone.0014444.

Computational identification and analysis of protein short linear motifs.

Front Biosci (Landmark Ed). 2010 Jun 1;15(3):801-25. doi: 10.2741/3647.

Predicting protein post-translational modifications using meta-analysis of proteome scale data sets.

Mol Cell Proteomics. 2009 Feb;8(2):365-79. doi: 10.1074/mcp.M800332-MCP200. Epub 2008 Oct 28.

Identification of motifs in protein sequences.

Curr Protoc Cell Biol. 2001 May;Appendix 1:Appendix 1C. doi: 10.1002/0471143030.cba01cs00.

AVID: an integrative framework for discovering functional relationships among proteins.

BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.

Robust unsupervised deconvolution of linear motifs characterizes 68 protein modifications at proteome scale.

Sci Rep. 2021 Nov 18;11(1):22490. doi: 10.1038/s41598-021-01971-3.

C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families.

BMC Genomics. 2007 Jun 26;8:191. doi: 10.1186/1471-2164-8-191.

Binding site prediction for protein-protein interactions and novel motif discovery using re-occurring polypeptide sequences.

BMC Bioinformatics. 2011 Jun 2;12:225. doi: 10.1186/1471-2105-12-225.

ELM: the status of the 2010 eukaryotic linear motif resource.

Nucleic Acids Res. 2010 Jan;38(Database issue):D167-80. doi: 10.1093/nar/gkp1016. Epub 2009 Nov 17.

SLiMSearch: a framework for proteome-wide discovery and annotation of functional modules in intrinsically disordered regions.

Nucleic Acids Res. 2017 Jul 3;45(W1):W464-W469. doi: 10.1093/nar/gkx238.

引用本文的文献

How different viruses perturb host cellular machinery via short linear motifs.

EXCLI J. 2023 Oct 26;22:1113-1128. doi: 10.17179/excli2023-6328. eCollection 2023.

Robust unsupervised deconvolution of linear motifs characterizes 68 protein modifications at proteome scale.

Sci Rep. 2021 Nov 18;11(1):22490. doi: 10.1038/s41598-021-01971-3.

Cytocompatibility of stabilized black phosphorus nanosheets tailored by directly conjugated polymeric micelles for human breast cancer therapy.

Sci Rep. 2021 Apr 29;11(1):9304. doi: 10.1038/s41598-021-88791-7.

SLiM-Enrich: computational assessment of protein-protein interaction data as a source of domain-motif interactions.

PeerJ. 2018 Oct 31;6:e5858. doi: 10.7717/peerj.5858. eCollection 2018.

Exhaustive search of linear information encoding protein-peptide recognition.

PLoS Comput Biol. 2017 Apr 20;13(4):e1005499. doi: 10.1371/journal.pcbi.1005499. eCollection 2017 Apr.

Regulatory elements in molecular networks.

Wiley Interdiscip Rev Syst Biol Med. 2017 May;9(3). doi: 10.1002/wsbm.1374. Epub 2017 Jan 17.

Evolution of domain-peptide interactions to coadapt specificity and affinity to functional diversity.

Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):E3862-71. doi: 10.1073/pnas.1518469113. Epub 2016 Jun 17.

QSLiMFinder: improved short linear motif prediction using specific query protein data.

Bioinformatics. 2015 Jul 15;31(14):2284-93. doi: 10.1093/bioinformatics/btv155. Epub 2015 Mar 19.

Detecting functional divergence after gene duplication through evolutionary changes in posttranslational regulatory sequences.

PLoS Comput Biol. 2014 Dec 4;10(12):e1003977. doi: 10.1371/journal.pcbi.1003977. eCollection 2014 Dec.

Tissue-aware data integration approach for the inference of pathway interactions in metazoan organisms.

Bioinformatics. 2015 Apr 1;31(7):1093-101. doi: 10.1093/bioinformatics/btu786. Epub 2014 Nov 26.

本文引用的文献

Revealing global regulatory perturbations across human cancers.

Mol Cell. 2009 Dec 11;36(5):900-11. doi: 10.1016/j.molcel.2009.11.016.

Global analysis of the mitochondrial N-proteome identifies a processing peptidase critical for protein stability.

Cell. 2009 Oct 16;139(2):428-39. doi: 10.1016/j.cell.2009.07.045.

Applying mass spectrometry-based proteomics to genetics, genomics and network biology.

Nat Rev Genet. 2009 Sep;10(9):617-27. doi: 10.1038/nrg2633.

KEPE--a motif frequently superimposed on sumoylation sites in metazoan chromatin proteins and transcription factors.

Bioinformatics. 2009 Jan 1;25(1):1-5. doi: 10.1093/bioinformatics/btn594. Epub 2008 Nov 24.

High-quality binary protein interaction map of the yeast interactome network.

Science. 2008 Oct 3;322(5898):104-10. doi: 10.1126/science.1158684. Epub 2008 Aug 21.

Understanding eukaryotic linear motifs and their role in cell signaling and regulation.

Front Biosci. 2008 May 1;13:6580-603. doi: 10.2741/3175.

CompariMotif: quick and easy comparisons of sequence motifs.

Bioinformatics. 2008 May 15;24(10):1307-9. doi: 10.1093/bioinformatics/btn105. Epub 2008 Mar 28.

A careful disorderliness in the proteome: sites for interaction and targets for future therapies.

FEBS Lett. 2008 Apr 9;582(8):1271-5. doi: 10.1016/j.febslet.2008.02.027. Epub 2008 Feb 20.

A computational strategy for the prediction of functional linear peptide motifs in proteins.

Bioinformatics. 2007 Dec 15;23(24):3297-303. doi: 10.1093/bioinformatics/btm524. Epub 2007 Oct 31.

A universal framework for regulatory element discovery across all genomes and data types.

Mol Cell. 2007 Oct 26;28(2):337-50. doi: 10.1016/j.molcel.2007.09.027.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

真核生物中蛋白质调控基序的大规模发现与描述。

Large-scale discovery and characterization of protein regulatory motifs in eukaryotes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献