SLiMDisc：短线性基序发现，校正共同进化起源。

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent.

作者信息

Davey Norman E, Shields Denis C, Edwards Richard J

机构信息

Conway Institute of Biomolecular and Biomedical Sciences, University College Dublin, Dublin 4, Ireland.

出版信息

Nucleic Acids Res. 2006 Jul 19;34(12):3546-54. doi: 10.1093/nar/gkl486. Print 2006.

DOI:10.1093/nar/gkl486

PMID:16855291

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1524906/

Abstract

Many important interactions of proteins are facilitated by short, linear motifs (SLiMs) within a protein's primary sequence. Our aim was to establish robust methods for discovering putative functional motifs. The strongest evidence for such motifs is obtained when the same motifs occur in unrelated proteins, evolving by convergence. In practise, searches for such motifs are often swamped by motifs shared in related proteins that are identical by descent. Prediction of motifs among sets of biologically related proteins, including those both with and without detectable similarity, were made using the TEIRESIAS algorithm. The number of motif occurrences arising through common evolutionary descent were normalized based on treatment of BLAST local alignments. Motifs were ranked according to a score derived from the product of the normalized number of occurrences and the information content. The method was shown to significantly outperform methods that do not discount evolutionary relatedness, when applied to known SLiMs from a subset of the eukaryotic linear motif (ELM) database. An implementation of Multiple Spanning Tree weighting outperformed two other weighting schemes, in a variety of settings.

摘要

蛋白质的许多重要相互作用是由蛋白质一级序列中的短线性基序（SLiMs）促成的。我们的目标是建立可靠的方法来发现推定的功能基序。当相同的基序出现在不相关的蛋白质中并通过趋同进化时，就能获得此类基序的最有力证据。实际上，寻找此类基序的搜索通常会被通过共同祖先遗传而相同的相关蛋白质中共享的基序所淹没。使用TEIRESIAS算法对包括具有和不具有可检测相似性的生物相关蛋白质组中的基序进行预测。基于对BLAST局部比对的处理，对通过共同进化遗传产生的基序出现次数进行归一化。基序根据从归一化出现次数与信息含量的乘积得出的分数进行排序。当应用于真核线性基序（ELM）数据库子集中的已知SLiMs时，该方法被证明明显优于不考虑进化相关性的方法。在各种设置下，多重生成树加权的一种实现方式优于其他两种加权方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa52/1524906/f7eeccad493c/gkl486f1.jpg

相似文献

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent.

Nucleic Acids Res. 2006 Jul 19;34(12):3546-54. doi: 10.1093/nar/gkl486. Print 2006.

The SLiMDisc server: short, linear motif discovery in proteins.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W455-9. doi: 10.1093/nar/gkm400. Epub 2007 Jun 18.

Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery.

Bioinformatics. 2009 Feb 15;25(4):443-50. doi: 10.1093/bioinformatics/btn664. Epub 2009 Jan 9.

DILIMOT: discovery of linear motifs in proteins.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W350-5. doi: 10.1093/nar/gkl159.

D-SLIMMER: domain-SLiM interaction motifs miner for sequence based protein-protein interaction data.

J Proteome Res. 2011 Dec 2;10(12):5285-95. doi: 10.1021/pr200312e. Epub 2011 Nov 1.

Attributes of short linear motifs.

Mol Biosyst. 2012 Jan;8(1):268-81. doi: 10.1039/c1mb05231d. Epub 2011 Sep 12.

Discovering sequence motifs.

Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12.

Identification of function-associated loop motifs and application to protein function prediction.

Bioinformatics. 2006 Sep 15;22(18):2237-43. doi: 10.1093/bioinformatics/btl382. Epub 2006 Jul 26.

Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure.

BMC Bioinformatics. 2008 Oct 10;9:429. doi: 10.1186/1471-2105-9-429.

Fast model-based protein homology detection without alignment.

Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.

引用本文的文献

Pangenome Reveals Gene Content Variations and Structural Variants Contributing to Pig Characteristics.

Genomics Proteomics Bioinformatics. 2025 Jan 15;22(6). doi: 10.1093/gpbjnl/qzae081.

Discovery and Characterization of Linear Motif Mediated Protein-Protein Complexes.

Adv Exp Med Biol. 2024;3234:59-71. doi: 10.1007/978-3-031-52193-5_5.

Whole-mitogenome analysis unveils previously undescribed genetic diversity in cane toads across their invasion trajectory.

Ecol Evol. 2024 Mar 3;14(3):e11115. doi: 10.1002/ece3.11115. eCollection 2024 Mar.

The Australasian dingo archetype: de novo chromosome-length genome assembly, DNA methylome, and cranial morphology.

Gigascience. 2023 Mar 20;12. doi: 10.1093/gigascience/giad018. Epub 2023 Mar 28.

Computational Prediction of Protein Intrinsically Disordered Region Related Interactions and Functions.

Genes (Basel). 2023 Feb 8;14(2):432. doi: 10.3390/genes14020432.

The Australasian dingo archetype: chromosome-length genome assembly, DNA methylome, and cranial morphology.

bioRxiv. 2023 Jan 27:2023.01.26.525801. doi: 10.1101/2023.01.26.525801.

Modulating biomolecular condensates: a novel approach to drug discovery.

Nat Rev Drug Discov. 2022 Nov;21(11):841-862. doi: 10.1038/s41573-022-00505-4. Epub 2022 Aug 16.

Transcript- and annotation-guided genome assembly of the European starling.

Mol Ecol Resour. 2022 Nov;22(8):3141-3160. doi: 10.1111/1755-0998.13679. Epub 2022 Jul 18.

Intrinsically disordered proteins play diverse roles in cell signaling.

Cell Commun Signal. 2022 Feb 17;20(1):20. doi: 10.1186/s12964-022-00821-7.

Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome.

BMC Genomics. 2021 Mar 16;22(1):188. doi: 10.1186/s12864-021-07493-6.

本文引用的文献

Human protein reference database--2006 update.

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D411-4. doi: 10.1093/nar/gkj141.

The Gene Ontology (GO) project in 2006.

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D322-6. doi: 10.1093/nar/gkj021.

Systematic discovery of new recognition peptides mediating protein interaction networks.

PLoS Biol. 2005 Dec;3(12):e405. doi: 10.1371/journal.pbio.0030405. Epub 2005 Nov 15.

QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W255-61. doi: 10.1093/nar/gki496.

Linear motifs: evolutionary interaction switches.

FEBS Lett. 2005 Jun 13;579(15):3342-5. doi: 10.1016/j.febslet.2005.04.005. Epub 2005 Apr 18.

Evolutionary distance estimation and fidelity of pair wise sequence alignment.

BMC Bioinformatics. 2005 Apr 19;6:102. doi: 10.1186/1471-2105-6-102.

The Universal Protein Resource (UniProt).

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.

ELM server: A new resource for investigating short functional sites in modular eukaryotic proteins.

Nucleic Acids Res. 2003 Jul 1;31(13):3625-30. doi: 10.1093/nar/gkg545.

Multiple sequence alignment with the Clustal series of programs.

Nucleic Acids Res. 2003 Jul 1;31(13):3497-500. doi: 10.1093/nar/gkg500.

Protein C-mannosylation: facts and questions.

Acta Biochim Pol. 2000;47(3):781-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SLiMDisc：短线性基序发现，校正共同进化起源。

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent.

作者信息

Davey Norman E, Shields Denis C, Edwards Richard J

机构信息

Conway Institute of Biomolecular and Biomedical Sciences, University College Dublin, Dublin 4, Ireland.

出版信息

Nucleic Acids Res. 2006 Jul 19;34(12):3546-54. doi: 10.1093/nar/gkl486. Print 2006.

DOI:10.1093/nar/gkl486

PMID:16855291

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1524906/

Abstract

摘要

SLiMDisc：短线性基序发现，校正共同进化起源。

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

SLiMDisc：短线性基序发现，校正共同进化起源。

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献