• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

准运动发现器:通过搜索进化上保守的基序样模式进行蛋白质注释。

QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns.

作者信息

Gutman Roee, Berezin Carine, Wollman Roy, Rosenberg Yossi, Ben-Tal Nir

机构信息

Department of Biochemistry, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel.

出版信息

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W255-61. doi: 10.1093/nar/gki496.

DOI:10.1093/nar/gki496
PMID:15980465
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1160256/
Abstract

Sequence signature databases such as PROSITE, which include amino acid segments that are indicative of a protein's function, are useful for protein annotation. Lamentably, the annotation is not always accurate. A signature may be falsely detected in a protein that does not carry out the associated function (false positive prediction, FP) or may be overlooked in a protein that does carry out the function (false negative prediction, FN). A new approach has emerged in which a signature is replaced with a sequence profile, calculated based on multiple sequence alignment (MSA) of homologous proteins that share the same function. This approach, which is superior to the simple pattern search, essentially searches with the sequence of the query protein against an MSA library. We suggest here an alternative approach, implemented in the QuasiMotiFinder web server (http://quasimotifinder.tau.ac.il/), which is based on a search with an MSA of homologous query proteins against the original PROSITE signatures. The explicit use of the average evolutionary conservation of the signature in the query proteins significantly reduces the rate of FP prediction compared with the simple pattern search. QuasiMotiFinder also has a reduced rate of FN prediction compared with simple pattern searches, since the traditional search for precise signatures has been replaced by a permissive search for signature-like patterns that are physicochemically similar to known signatures. Overall, QuasiMotiFinder and the profile search are comparable to each other in terms of performance. They are also complementary to each other in that signatures that are falsely detected in (or overlooked by) one may be correctly detected by the other.

摘要

诸如PROSITE之类的序列特征数据库,其中包含指示蛋白质功能的氨基酸片段,对蛋白质注释很有用。遗憾的是,注释并不总是准确的。一个特征可能在不执行相关功能的蛋白质中被错误检测到(假阳性预测,FP),或者可能在执行该功能的蛋白质中被忽略(假阴性预测,FN)。一种新方法出现了,即用基于具有相同功能的同源蛋白质的多序列比对(MSA)计算的序列概况来取代特征。这种方法优于简单的模式搜索,本质上是用查询蛋白质的序列与一个MSA库进行比对。我们在此提出一种替代方法,在QuasiMotiFinder网络服务器(http://quasimotifinder.tau.ac.il/)中实现,该方法基于用同源查询蛋白质的MSA与原始PROSITE特征进行比对。与简单的模式搜索相比,明确使用查询蛋白质中特征的平均进化保守性可显著降低FP预测率。与简单模式搜索相比,QuasiMotiFinder的FN预测率也有所降低,因为传统的精确特征搜索已被对与已知特征在物理化学上相似的特征样模式的宽松搜索所取代。总体而言,QuasiMotiFinder和概况搜索在性能方面相互可比。它们也相互补充,因为在一个中被错误检测到(或被忽略)的特征可能会被另一个正确检测到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/702c54233b3f/gki496f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/df5855445695/gki496f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/3f05b9f5d21e/gki496f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/702c54233b3f/gki496f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/df5855445695/gki496f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/3f05b9f5d21e/gki496f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/129a/1160256/702c54233b3f/gki496f3.jpg

相似文献

1
QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns.准运动发现器:通过搜索进化上保守的基序样模式进行蛋白质注释。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W255-61. doi: 10.1093/nar/gki496.
2
Scan2S: increasing the precision of PROSITE pattern motifs using secondary structure constraints.Scan2S:利用二级结构约束提高PROSITE模式基序的精度
Proteins. 2008 Sep;72(4):1138-47. doi: 10.1002/prot.22008.
3
SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs.SCANMOT:通过同时扫描多个序列基序来搜索相似序列。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W274-6. doi: 10.1093/nar/gki493.
4
Designing patterns for profile HMM search.设计用于隐马尔可夫模型轮廓搜索的模式。
Bioinformatics. 2007 Jan 15;23(2):e36-43. doi: 10.1093/bioinformatics/btl323.
5
Recent improvements to the PROSITE database.PROSITE数据库的近期改进。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D134-7. doi: 10.1093/nar/gkh044.
6
WebAllergen: a web server for predicting allergenic proteins.网络变应原:一个用于预测变应原性蛋白质的网络服务器。
Bioinformatics. 2005 May 15;21(10):2570-1. doi: 10.1093/bioinformatics/bti356. Epub 2005 Mar 3.
7
DOMAC: an accurate, hybrid protein domain prediction server.DOMAC:一个准确的混合蛋白质结构域预测服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W354-6. doi: 10.1093/nar/gkm390. Epub 2007 Jun 6.
8
NdPASA: a pairwise sequence alignment server for distantly related proteins.NdPASA:用于远缘相关蛋白质的双序列比对服务器。
Bioinformatics. 2005 Oct 1;21(19):3803-5. doi: 10.1093/bioinformatics/bti619. Epub 2005 Aug 16.
9
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
10
HHsenser: exhaustive transitive profile search using HMM-HMM comparison.HHsenser:使用隐马尔可夫模型-隐马尔可夫模型比较进行详尽的传递性轮廓搜索。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W374-8. doi: 10.1093/nar/gkl195.

引用本文的文献

1
PSSMSearch: a server for modeling, visualization, proteome-wide discovery and annotation of protein motif specificity determinants.PSSMSearch:一个用于建模、可视化、蛋白质基序特异性决定因素的全蛋白质组发现和注释的服务器。
Nucleic Acids Res. 2018 Jul 2;46(W1):W235-W241. doi: 10.1093/nar/gky426.
2
Novel linear motif filtering protocol reveals the role of the LC8 dynein light chain in the Hippo pathway.新型线性基序筛选方案揭示了LC8动力蛋白轻链在Hippo信号通路中的作用。
PLoS Comput Biol. 2017 Dec 14;13(12):e1005885. doi: 10.1371/journal.pcbi.1005885. eCollection 2017 Dec.
3
SLiMSearch: a framework for proteome-wide discovery and annotation of functional modules in intrinsically disordered regions.

本文引用的文献

1
eBLOCKs: enumerating conserved protein blocks to achieve maximal sensitivity and specificity.eBLOCKs:枚举保守蛋白质模块以实现最大灵敏度和特异性。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D178-82. doi: 10.1093/nar/gki060.
2
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D138-41. doi: 10.1093/nar/gkh121.
3
Recent improvements to the PROSITE database.PROSITE数据库的近期改进。
SLiMSearch:一种用于在无序区域中进行蛋白质组功能模块的全局发现和注释的框架。
Nucleic Acids Res. 2017 Jul 3;45(W1):W464-W469. doi: 10.1093/nar/gkx238.
4
Redefining the BH3 Death Domain as a 'Short Linear Motif'.将BH3死亡结构域重新定义为“短线性基序”。
Trends Biochem Sci. 2015 Dec;40(12):736-748. doi: 10.1016/j.tibs.2015.09.007. Epub 2015 Nov 3.
5
SeqNLS: nuclear localization signal prediction based on frequent pattern mining and linear motif scoring.SeqNLS:基于频繁模式挖掘和线性基序评分的核定位信号预测
PLoS One. 2013 Oct 29;8(10):e76864. doi: 10.1371/journal.pone.0076864. eCollection 2013.
6
CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.CLIPS-1D:分析多重序列比对,推断残基位置在催化、配体结合或蛋白质结构中的作用。
BMC Bioinformatics. 2012 Apr 5;13:55. doi: 10.1186/1471-2105-13-55.
7
SLiMSearch 2.0: biological context for short linear motifs in proteins.SLiMSearch 2.0:蛋白质中短线性基序的生物学背景。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W56-60. doi: 10.1093/nar/gkr402. Epub 2011 May 26.
8
SLiMFinder: a web server to find novel, significantly over-represented, short protein motifs.SLiMFinder:一个用于发现新颖的、显著过度表达的短蛋白基序的网络服务器。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W534-9. doi: 10.1093/nar/gkq440. Epub 2010 May 23.
9
Novel SPG11 mutations in Asian kindreds and disruption of spatacsin function in the zebrafish.新型 SPG11 突变与斑马鱼中 spataxsin 功能障碍
Neurogenetics. 2010 Oct;11(4):379-89. doi: 10.1007/s10048-010-0243-8.
10
Evidence for the concerted evolution between short linear protein motifs and their flanking regions.短线性蛋白质基序与其侧翼区域之间协同进化的证据。
PLoS One. 2009 Jul 8;4(7):e6052. doi: 10.1371/journal.pone.0006052.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D134-7. doi: 10.1093/nar/gkh044.
4
ELM server: A new resource for investigating short functional sites in modular eukaryotic proteins.ELM服务器:一种用于研究模块化真核生物蛋白质中短功能位点的新资源。
Nucleic Acids Res. 2003 Jul 1;31(13):3625-30. doi: 10.1093/nar/gkg545.
5
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.2003年的SWISS-PROT蛋白质知识库及其补充TrEMBL。
Nucleic Acids Res. 2003 Jan 1;31(1):365-70. doi: 10.1093/nar/gkg095.
6
Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues.Rate4Site:一种通过蛋白质同源物中进化决定因素的表面映射来识别蛋白质功能区域的算法工具。
Bioinformatics. 2002;18 Suppl 1:S71-7. doi: 10.1093/bioinformatics/18.suppl_1.s71.
7
The EMOTIF database.EMOTIF数据库。
Nucleic Acids Res. 2001 Jan 1;29(1):202-4. doi: 10.1093/nar/29.1.202.
8
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
9
Subtilases: the superfamily of subtilisin-like serine proteases.枯草杆菌蛋白酶:枯草杆菌蛋白酶样丝氨酸蛋白酶超家族
Protein Sci. 1997 Mar;6(3):501-23. doi: 10.1002/pro.5560060301.
10
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.CLUSTAL W:通过序列加权、位置特异性空位罚分和权重矩阵选择提高渐进多序列比对的灵敏度。
Nucleic Acids Res. 1994 Nov 11;22(22):4673-80. doi: 10.1093/nar/22.22.4673.