基于序列和结构比对的特定位置得分矩阵比较。

A comparison of position-specific score matrices based on sequence and structure alignments.

作者信息

Panchenko Anna R, Bryant Stephen H

机构信息

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.

出版信息

Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902.

DOI:10.1110/ps.19902

PMID:11790846

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2373449/

Abstract

Sequence comparison methods based on position-specific score matrices (PSSMs) have proven a useful tool for recognition of the divergent members of a protein family and for annotation of functional sites. Here we investigate one of the factors that affects overall performance of PSSMs in a PSI-BLAST search, the algorithm used to construct the seed alignment upon which the PSSM is based. We compare PSSMs based on alignments constructed by global sequence similarity (ClustalW and ClustalW-pairwise), local sequence similarity (BLAST), and local structure similarity (VAST). To assess performance with respect to identification of conserved functional or structural sites, we examine the accuracy of the three-dimensional molecular models predicted by PSSM-sequence alignments. Using the known structures of those sequences as the standard of truth, we find that model accuracy varies with the algorithm used for seed alignment construction in the pattern local-structure (VAST) > local-sequence (BLAST) > global-sequence (ClustalW). Using structural similarity of query and database proteins as the standard of truth, we find that PSSM recognition sensitivity depends primarily on the diversity of the sequences included in the alignment, with an optimum around 30-50% average pairwise identity. We discuss these observations, and suggest a strategy for constructing seed alignments that optimize PSSM-sequence alignment accuracy and recognition sensitivity.

摘要

基于位置特异性得分矩阵（PSSM）的序列比较方法已被证明是识别蛋白质家族中不同成员以及注释功能位点的有用工具。在此，我们研究了影响PSI-BLAST搜索中PSSM整体性能的一个因素，即用于构建PSSM所基于的种子比对的算法。我们比较了基于通过全局序列相似性（ClustalW和ClustalW成对）、局部序列相似性（BLAST）和局部结构相似性（VAST）构建的比对的PSSM。为了评估在识别保守功能或结构位点方面的性能，我们检查了由PSSM序列比对预测的三维分子模型的准确性。以那些序列的已知结构作为真实标准，我们发现模型准确性随用于种子比对构建的算法而变化，模式为局部结构（VAST）>局部序列（BLAST）>全局序列（ClustalW）。以查询蛋白和数据库蛋白的结构相似性作为真实标准，我们发现PSSM识别灵敏度主要取决于比对中包含的序列的多样性，平均成对同一性约为30 - 50%时达到最佳。我们讨论了这些观察结果，并提出了一种构建种子比对的策略，以优化PSSM序列比对准确性和识别灵敏度。

相似文献

A comparison of position-specific score matrices based on sequence and structure alignments.

Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902.

PSSM-based prediction of DNA binding sites in proteins.

BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.

Use of multiple profiles corresponding to a sequence alignment enables effective detection of remote homologues.

Bioinformatics. 2005 Jun 15;21(12):2821-6. doi: 10.1093/bioinformatics/bti432. Epub 2005 Apr 7.

Accuracy of structure-based sequence alignment of automatic methods.

BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.

IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.

Bioinformatics. 1999 Dec;15(12):1000-11. doi: 10.1093/bioinformatics/15.12.1000.

Fast model-based protein homology detection without alignment.

Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.

Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.

Protein Sci. 2000 Nov;9(11):2278-84. doi: 10.1110/ps.9.11.2278.

A comparison of scoring functions for protein sequence profile alignment.

Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12.

PASS2: an automated database of protein alignments organised as structural superfamilies.

BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.

OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.

BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.

引用本文的文献

Exploring the Promoter Generation and Prediction of spp. Based on GAN and Multi-Model Fusion Methods.

Int J Mol Sci. 2024 Dec 6;25(23):13137. doi: 10.3390/ijms252313137.

Kabirian-based optinalysis: A conceptually grounded framework for symmetry/asymmetry, similarity/dissimilarity and identity/unidentity estimations in mathematical structures and biological sequences.

MethodsX. 2023 Oct 1;11:102400. doi: 10.1016/j.mex.2023.102400. eCollection 2023 Dec.

A gatekeeper chaperone complex directs translocator secretion during type three secretion.

PLoS Pathog. 2014 Nov 6;10(11):e1004498. doi: 10.1371/journal.ppat.1004498. eCollection 2014 Nov.

Current progress in Structure-Based Rational Drug Design marks a new mindset in drug discovery.

Comput Struct Biotechnol J. 2013 Apr 2;5:e201302011. doi: 10.5936/csbj.201302011. eCollection 2013.

Domain enhanced lookup time accelerated BLAST.

Biol Direct. 2012 Apr 17;7:12. doi: 10.1186/1745-6150-7-12.

Protein sequence alignment with family-specific amino acid similarity matrices.

BMC Res Notes. 2011 Aug 16;4:296. doi: 10.1186/1756-0500-4-296.

Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition.

PLoS One. 2010 May 5;5(5):e10410. doi: 10.1371/journal.pone.0010410.

Structural and functional studies indicate that Shigella VirA is not a protease and does not directly destabilize microtubules.

Biochemistry. 2008 Sep 30;47(39):10241-3. doi: 10.1021/bi801533k. Epub 2008 Sep 3.

Structural and kinetic studies of induced fit in xylulose kinase from Escherichia coli.

J Mol Biol. 2007 Jan 19;365(3):783-98. doi: 10.1016/j.jmb.2006.10.068. Epub 2006 Oct 25.

Application of protein structure alignments to iterated hidden Markov model protocols for structure prediction.

BMC Bioinformatics. 2006 Sep 14;7:410. doi: 10.1186/1471-2105-7-410.

本文引用的文献

The Protein Data Bank and the challenge of structural genomics.

Nat Struct Biol. 2000 Nov;7 Suppl:957-9. doi: 10.1038/80734.

T-Coffee: A novel method for fast and accurate multiple sequence alignment.

J Mol Biol. 2000 Sep 8;302(1):205-17. doi: 10.1006/jmbi.2000.4042.

Enhanced genome annotation using structural profiles in the program 3D-PSSM.

J Mol Biol. 2000 Jun 2;299(2):499-520. doi: 10.1006/jmbi.2000.3741.

Large-scale comparison of protein sequence alignment algorithms with structure alignments.

Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.

Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Protein Sci. 2000 Feb;9(2):232-41. doi: 10.1110/ps.9.2.232.

Combination of threading potentials and sequence profiles improves fold recognition.

J Mol Biol. 2000 Mar 10;296(5):1319-31. doi: 10.1006/jmbi.2000.3541.

Evolution of protein sequences and structures.

J Mol Biol. 1999 Aug 27;291(4):977-95. doi: 10.1006/jmbi.1999.2972.

A protein taxonomy based on secondary structure.

Nat Struct Biol. 1999 Jul;6(7):672-82. doi: 10.1038/10728.

A comprehensive comparison of multiple sequence alignment programs.

Nucleic Acids Res. 1999 Jul 1;27(13):2682-90. doi: 10.1093/nar/27.13.2682.

Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches.

J Mol Biol. 1999 Apr 16;287(5):1023-40. doi: 10.1006/jmbi.1999.2653.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序列和结构比对的特定位置得分矩阵比较。

A comparison of position-specific score matrices based on sequence and structure alignments.

作者信息

Panchenko Anna R, Bryant Stephen H

机构信息

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.

出版信息

Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902.

DOI:10.1110/ps.19902

PMID:11790846

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2373449/

Abstract

摘要

基于序列和结构比对的特定位置得分矩阵比较。

A comparison of position-specific score matrices based on sequence and structure alignments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于序列和结构比对的特定位置得分矩阵比较。

A comparison of position-specific score matrices based on sequence and structure alignments.

作者信息

机构信息

出版信息