• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Scan2S:利用二级结构约束提高PROSITE模式基序的精度

Scan2S: increasing the precision of PROSITE pattern motifs using secondary structure constraints.

作者信息

Skrabanek Lucy, Niv Masha Y

机构信息

Department of Physiology and Biophysics, Weill Medical College of Cornell University, New York, New York 10021, USA.

出版信息

Proteins. 2008 Sep;72(4):1138-47. doi: 10.1002/prot.22008.

DOI:10.1002/prot.22008
PMID:18320586
Abstract

Sequence signature databases such as PROSITE, which include protein pattern motifs indicative of a protein's function, are widely used for function prediction studies, cellular localization annotation, and sequence classification. Correct annotation relies on high precision of the motifs. We present a new and general approach for increasing the precision of established protein pattern motifs by including secondary structure constraints (SSCs). We use Scan2S, the first sequence motif-scanning program to optionally include SSCs, to augment PROSITE pattern motifs. The constraints were derived from either the DSSP secondary structure assignment or the PSIPRED predictions for PROSITE-documented true positive hits. The secondary structure-augmented motifs were scanned against all SwissProt sequences, for which secondary structure predictions were precalculated. Against this dataset, motifs with PSIPRED-derived SSCs exhibited improved performance over motifs with DSSP-derived constraints. The precision of 763 of the 782 PSIPRED-augmented motifs remained unchanged or increased compared to the original motifs; 26 motifs showed an absolute precision increase of 10-30%. We provide the complete set of augmented motifs and the Scan2S program at http://physiology.med.cornell.edu/go/scan2s. Our results suggest a general protocol for increasing the precision of protein pattern detection via the inclusion of SSCs.

摘要

诸如PROSITE之类的序列特征数据库,其中包含指示蛋白质功能的蛋白质模式基序,被广泛用于功能预测研究、细胞定位注释和序列分类。正确的注释依赖于基序的高精度。我们提出了一种新的通用方法,通过纳入二级结构约束(SSC)来提高已建立的蛋白质模式基序的精度。我们使用Scan2S(第一个可选择性纳入SSC的序列基序扫描程序)来增强PROSITE模式基序。这些约束要么来自DSSP二级结构分配,要么来自对PROSITE记录的真阳性命中的PSIPRED预测。针对所有预先计算了二级结构预测的SwissProt序列扫描二级结构增强的基序。在这个数据集上,具有PSIPRED衍生的SSC的基序比具有DSSP衍生约束的基序表现出更好的性能。与原始基序相比,782个PSIPRED增强基序中的763个的精度保持不变或提高;26个基序的绝对精度提高了10 - 30%。我们在http://physiology.med.cornell.edu/go/scan2s上提供了完整的增强基序集和Scan2S程序。我们的结果表明了一种通过纳入SSC来提高蛋白质模式检测精度的通用方案。

相似文献

1
Scan2S: increasing the precision of PROSITE pattern motifs using secondary structure constraints.Scan2S:利用二级结构约束提高PROSITE模式基序的精度
Proteins. 2008 Sep;72(4):1138-47. doi: 10.1002/prot.22008.
2
QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns.准运动发现器:通过搜索进化上保守的基序样模式进行蛋白质注释。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W255-61. doi: 10.1093/nar/gki496.
3
Motif-based fold assignment.基于基序的折叠分配
Protein Sci. 2001 Dec;10(12):2460-9. doi: 10.1110/ps.14401.
4
Genome-wide enzyme annotation with precision control: catalytic families (CatFam) databases.具有精确控制的全基因组酶注释:催化家族(CatFam)数据库。
Proteins. 2009 Feb 1;74(2):449-60. doi: 10.1002/prot.22167.
5
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure.
BMC Bioinformatics. 2008 Oct 10;9:429. doi: 10.1186/1471-2105-9-429.
6
Mining super-secondary structure motifs from 3d protein structures: a sequence order independent approach.从三维蛋白质结构中挖掘超二级结构基序:一种与序列顺序无关的方法。
Genome Inform. 2007;19:15-26.
7
Exploring an alignment free approach for protein classification and structural class prediction.探索一种用于蛋白质分类和结构类别预测的无比对方法。
Biochimie. 2008 Apr;90(4):615-25. doi: 10.1016/j.biochi.2007.11.004. Epub 2007 Nov 19.
8
Discovering sequence motifs.发现序列基序。
Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12.
9
ScanProsite: a reference implementation of a PROSITE scanning tool.ScanProsite:PROSITE扫描工具的参考实现。
Appl Bioinformatics. 2002;1(2):107-8.
10
DILIMOT: discovery of linear motifs in proteins.DILIMOT:蛋白质中线性基序的发现
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W350-5. doi: 10.1093/nar/gkl159.

引用本文的文献

1
Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.使用Scan2S——一种具有可选二级结构约束的新型基序扫描算法,鉴定识别GATC和CCGG的II型限制性内切酶及其假定的特异性决定位点。
Proteins. 2008 May 1;71(2):631-40. doi: 10.1002/prot.21777.