• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序列和结构比对的特定位置得分矩阵比较。

A comparison of position-specific score matrices based on sequence and structure alignments.

作者信息

Panchenko Anna R, Bryant Stephen H

机构信息

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.

出版信息

Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902.

DOI:10.1110/ps.19902
PMID:11790846
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2373449/
Abstract

Sequence comparison methods based on position-specific score matrices (PSSMs) have proven a useful tool for recognition of the divergent members of a protein family and for annotation of functional sites. Here we investigate one of the factors that affects overall performance of PSSMs in a PSI-BLAST search, the algorithm used to construct the seed alignment upon which the PSSM is based. We compare PSSMs based on alignments constructed by global sequence similarity (ClustalW and ClustalW-pairwise), local sequence similarity (BLAST), and local structure similarity (VAST). To assess performance with respect to identification of conserved functional or structural sites, we examine the accuracy of the three-dimensional molecular models predicted by PSSM-sequence alignments. Using the known structures of those sequences as the standard of truth, we find that model accuracy varies with the algorithm used for seed alignment construction in the pattern local-structure (VAST) > local-sequence (BLAST) > global-sequence (ClustalW). Using structural similarity of query and database proteins as the standard of truth, we find that PSSM recognition sensitivity depends primarily on the diversity of the sequences included in the alignment, with an optimum around 30-50% average pairwise identity. We discuss these observations, and suggest a strategy for constructing seed alignments that optimize PSSM-sequence alignment accuracy and recognition sensitivity.

摘要

基于位置特异性得分矩阵(PSSM)的序列比较方法已被证明是识别蛋白质家族中不同成员以及注释功能位点的有用工具。在此,我们研究了影响PSI-BLAST搜索中PSSM整体性能的一个因素,即用于构建PSSM所基于的种子比对的算法。我们比较了基于通过全局序列相似性(ClustalW和ClustalW成对)、局部序列相似性(BLAST)和局部结构相似性(VAST)构建的比对的PSSM。为了评估在识别保守功能或结构位点方面的性能,我们检查了由PSSM序列比对预测的三维分子模型的准确性。以那些序列的已知结构作为真实标准,我们发现模型准确性随用于种子比对构建的算法而变化,模式为局部结构(VAST)>局部序列(BLAST)>全局序列(ClustalW)。以查询蛋白和数据库蛋白的结构相似性作为真实标准,我们发现PSSM识别灵敏度主要取决于比对中包含的序列的多样性,平均成对同一性约为30 - 50%时达到最佳。我们讨论了这些观察结果,并提出了一种构建种子比对的策略,以优化PSSM序列比对准确性和识别灵敏度。

相似文献

1
A comparison of position-specific score matrices based on sequence and structure alignments.基于序列和结构比对的特定位置得分矩阵比较。
Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902.
2
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
3
Use of multiple profiles corresponding to a sequence alignment enables effective detection of remote homologues.使用与序列比对相对应的多个图谱能够有效地检测远源同源物。
Bioinformatics. 2005 Jun 15;21(12):2821-6. doi: 10.1093/bioinformatics/bti432. Epub 2005 Apr 7.
4
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
5
IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.IMPALA:将蛋白质序列与一组由PSI-BLAST构建的位置特异性得分矩阵进行匹配。
Bioinformatics. 1999 Dec;15(12):1000-11. doi: 10.1093/bioinformatics/15.12.1000.
6
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
7
Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.与结构比对相比,PSI-BLAST比对准确性的评估。
Protein Sci. 2000 Nov;9(11):2278-84. doi: 10.1110/ps.9.11.2278.
8
A comparison of scoring functions for protein sequence profile alignment.蛋白质序列谱比对评分函数的比较
Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12.
9
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
10
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.

引用本文的文献

1
Exploring the Promoter Generation and Prediction of spp. Based on GAN and Multi-Model Fusion Methods.基于生成对抗网络和多模型融合方法探索spp.的启动子生成与预测
Int J Mol Sci. 2024 Dec 6;25(23):13137. doi: 10.3390/ijms252313137.
2
Kabirian-based optinalysis: A conceptually grounded framework for symmetry/asymmetry, similarity/dissimilarity and identity/unidentity estimations in mathematical structures and biological sequences.基于卡比里安的可选分析:一个在数学结构和生物序列中用于对称性/不对称性、相似性/不相似性以及同一性/非同一性估计的概念性基础框架。
MethodsX. 2023 Oct 1;11:102400. doi: 10.1016/j.mex.2023.102400. eCollection 2023 Dec.
3
A gatekeeper chaperone complex directs translocator secretion during type three secretion.一种守门人伴侣蛋白复合体在三型分泌过程中指导转运体的分泌。
PLoS Pathog. 2014 Nov 6;10(11):e1004498. doi: 10.1371/journal.ppat.1004498. eCollection 2014 Nov.
4
Current progress in Structure-Based Rational Drug Design marks a new mindset in drug discovery.基于结构的合理药物设计的当前进展标志着药物发现领域的一种新思维方式。
Comput Struct Biotechnol J. 2013 Apr 2;5:e201302011. doi: 10.5936/csbj.201302011. eCollection 2013.
5
Domain enhanced lookup time accelerated BLAST.基于域名的快速检索 BLAST。
Biol Direct. 2012 Apr 17;7:12. doi: 10.1186/1745-6150-7-12.
6
Protein sequence alignment with family-specific amino acid similarity matrices.使用家族特异性氨基酸相似性矩阵进行蛋白质序列比对。
BMC Res Notes. 2011 Aug 16;4:296. doi: 10.1186/1756-0500-4-296.
7
Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition.计算蛋白质设计:验证及其作为同源搜索和折叠识别工具的可能相关性。
PLoS One. 2010 May 5;5(5):e10410. doi: 10.1371/journal.pone.0010410.
8
Structural and functional studies indicate that Shigella VirA is not a protease and does not directly destabilize microtubules.结构和功能研究表明,志贺氏菌VirA不是一种蛋白酶,也不会直接破坏微管的稳定性。
Biochemistry. 2008 Sep 30;47(39):10241-3. doi: 10.1021/bi801533k. Epub 2008 Sep 3.
9
Structural and kinetic studies of induced fit in xylulose kinase from Escherichia coli.大肠杆菌木酮糖激酶诱导契合的结构与动力学研究
J Mol Biol. 2007 Jan 19;365(3):783-98. doi: 10.1016/j.jmb.2006.10.068. Epub 2006 Oct 25.
10
Application of protein structure alignments to iterated hidden Markov model protocols for structure prediction.蛋白质结构比对在用于结构预测的迭代隐马尔可夫模型协议中的应用。
BMC Bioinformatics. 2006 Sep 14;7:410. doi: 10.1186/1471-2105-7-410.

本文引用的文献

1
The Protein Data Bank and the challenge of structural genomics.蛋白质数据库与结构基因组学的挑战。
Nat Struct Biol. 2000 Nov;7 Suppl:957-9. doi: 10.1038/80734.
2
T-Coffee: A novel method for fast and accurate multiple sequence alignment.T-Coffee:一种用于快速准确的多序列比对的新方法。
J Mol Biol. 2000 Sep 8;302(1):205-17. doi: 10.1006/jmbi.2000.4042.
3
Enhanced genome annotation using structural profiles in the program 3D-PSSM.在程序3D-PSSM中使用结构概况增强基因组注释。
J Mol Biol. 2000 Jun 2;299(2):499-520. doi: 10.1006/jmbi.2000.3741.
4
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
5
Comparison of sequence profiles. Strategies for structural predictions using sequence information.序列图谱比较。利用序列信息进行结构预测的策略。
Protein Sci. 2000 Feb;9(2):232-41. doi: 10.1110/ps.9.2.232.
6
Combination of threading potentials and sequence profiles improves fold recognition.穿线势能与序列谱相结合可提高折叠识别能力。
J Mol Biol. 2000 Mar 10;296(5):1319-31. doi: 10.1006/jmbi.2000.3541.
7
Evolution of protein sequences and structures.蛋白质序列与结构的进化
J Mol Biol. 1999 Aug 27;291(4):977-95. doi: 10.1006/jmbi.1999.2972.
8
A protein taxonomy based on secondary structure.一种基于二级结构的蛋白质分类法。
Nat Struct Biol. 1999 Jul;6(7):672-82. doi: 10.1038/10728.
9
A comprehensive comparison of multiple sequence alignment programs.多个序列比对程序的全面比较。
Nucleic Acids Res. 1999 Jul 1;27(13):2682-90. doi: 10.1093/nar/27.13.2682.
10
Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches.通过迭代数据库搜索获取有关蛋白质的重要结构、功能和进化信息。
J Mol Biol. 1999 Apr 16;287(5):1023-40. doi: 10.1006/jmbi.1999.2653.