Suppr超能文献

基于序列和结构特征对多个蛋白质结构进行比对。

Alignment of multiple protein structures based on sequence and structure features.

作者信息

Madhusudhan M S, Webb Benjamin M, Marti-Renom Marc A, Eswar Narayanan, Sali Andrej

机构信息

Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA 94158, USA.

出版信息

Protein Eng Des Sel. 2009 Sep;22(9):569-74. doi: 10.1093/protein/gzp040. Epub 2009 Jul 8.

Abstract

Comparing the structures of proteins is crucial to gaining insight into protein evolution and function. Here, we align the sequences of multiple protein structures by a dynamic programming optimization of a scoring function that is a sum of an affine gap penalty and terms dependent on various sequence and structure features (SALIGN). The features include amino acid residue type, residue position, residue accessible surface area, residue secondary structure state and the conformation of a short segment centered on the residue. The multiple alignment is built by following the 'guide' tree constructed from the matrix of all pairwise protein alignment scores. Importantly, the method does not depend on the exact values of various parameters, such as feature weights and gap penalties, because the optimal alignment across a range of parameter values is found. Using multiple structure alignments in the HOMSTRAD database, SALIGN was benchmarked against MUSTANG for multiple alignments as well as against TM-align and CE for pairwise alignments. On the average, SALIGN produces a 15% improvement in structural overlap over HOMSTRAD and 14% over MUSTANG, and yields more equivalent structural positions than TM-align and CE in 90% and 95% of cases, respectively. The utility of accurate multiple structure alignment is illustrated by its application to comparative protein structure modeling.

摘要

比较蛋白质的结构对于深入了解蛋白质的进化和功能至关重要。在这里,我们通过对一个评分函数进行动态规划优化来比对多个蛋白质结构的序列,该评分函数是一个仿射空位罚分与依赖于各种序列和结构特征的项之和(SALIGN)。这些特征包括氨基酸残基类型、残基位置、残基可及表面积、残基二级结构状态以及以该残基为中心的短片段的构象。多重比对是通过遵循从所有成对蛋白质比对分数矩阵构建的“引导”树来构建的。重要的是,该方法不依赖于各种参数的精确值,如特征权重和空位罚分,因为可以找到一系列参数值上的最优比对。使用HOMSTRAD数据库中的多个结构比对,将SALIGN与用于多重比对的MUSTANG以及用于成对比对的TM-align和CE进行了基准测试。平均而言,SALIGN在结构重叠方面比HOMSTRAD提高了15%,比MUSTANG提高了14%,并且在90%和95%的情况下分别比TM-align和CE产生更多等效的结构位置。准确的多重结构比对在比较蛋白质结构建模中的应用说明了其效用。

相似文献

1
Alignment of multiple protein structures based on sequence and structure features.
Protein Eng Des Sel. 2009 Sep;22(9):569-74. doi: 10.1093/protein/gzp040. Epub 2009 Jul 8.
2
Adaptive Smith-Waterman residue match seeding for protein structural alignment.
Proteins. 2013 Oct;81(10):1823-39. doi: 10.1002/prot.24327. Epub 2013 Aug 19.
3
mTM-align: an algorithm for fast and accurate multiple protein structure alignment.
Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.
4
SALIGN: a web server for alignment of multiple protein sequences and structures.
Bioinformatics. 2012 Aug 1;28(15):2072-3. doi: 10.1093/bioinformatics/bts302. Epub 2012 May 21.
5
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.
6
MUSTANG: a multiple structural alignment algorithm.
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
7
SE: an algorithm for deriving sequence alignment from a pair of superimposed structures.
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-10-S1-S4.
8
Large-scale comparison of protein sequence alignment algorithms with structure alignments.
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
9
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.
PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.
10
SAD--a normalized structural alignment database: improving sequence-structure alignments.
Bioinformatics. 2004 Oct 12;20(15):2333-44. doi: 10.1093/bioinformatics/bth244. Epub 2004 Apr 15.

引用本文的文献

1
ASMC: investigating the amino acid diversity of enzyme active sites.
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf307.
2
Boosting the Full Potential of PyMOL with Structural Biology Plugins.
Biomolecules. 2022 Nov 27;12(12):1764. doi: 10.3390/biom12121764.
3
Exfoliative Toxin E, Oligomeric State and Flip of P186: Implications for Its Action Mechanism.
Int J Mol Sci. 2022 Aug 30;23(17):9857. doi: 10.3390/ijms23179857.
5
Alignment-Integrated Reconstruction of Ancestral Sequences Improves Accuracy.
Genome Biol Evol. 2020 Sep 1;12(9):1549-1565. doi: 10.1093/gbe/evaa164.
6
mTM-align: an algorithm for fast and accurate multiple protein structure alignment.
Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.
7
Characterization of SPP inhibitors suppressing propagation of HCV and protozoa.
Proc Natl Acad Sci U S A. 2017 Dec 12;114(50):E10782-E10791. doi: 10.1073/pnas.1712484114. Epub 2017 Nov 29.
8
Discovery of novel non-competitive inhibitors of mammalian neutral M1 aminopeptidase (APN).
Biochimie. 2017 Nov;142:216-225. doi: 10.1016/j.biochi.2017.09.015. Epub 2017 Sep 28.
10
Comparative Protein Structure Modeling Using MODELLER.
Curr Protoc Bioinformatics. 2016 Jun 20;54:5.6.1-5.6.37. doi: 10.1002/cpbi.3.

本文引用的文献

1
SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny.
Nucleic Acids Res. 2009 Jan;37(Database issue):D380-6. doi: 10.1093/nar/gkn762. Epub 2008 Nov 26.
3
Using multiple templates to improve quality of homology models in automated homology modeling.
Protein Sci. 2008 Jun;17(6):990-1002. doi: 10.1110/ps.073344908. Epub 2008 Apr 25.
4
Matt: local flexibility aids protein multiple structure alignment.
PLoS Comput Biol. 2008 Jan;4(1):e10. doi: 10.1371/journal.pcbi.0040010.
5
Comparative protein structure modeling by combining multiple templates and optimizing sequence-to-structure alignments.
Bioinformatics. 2007 Oct 1;23(19):2558-65. doi: 10.1093/bioinformatics/btm377. Epub 2007 Sep 6.
6
DBAli tools: mining the protein structure space.
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W393-7. doi: 10.1093/nar/gkm236. Epub 2007 May 3.
7
MUSTANG: a multiple structural alignment algorithm.
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
8
Variable gap penalty for protein sequence-structure alignment.
Protein Eng Des Sel. 2006 Mar;19(3):129-33. doi: 10.1093/protein/gzj005. Epub 2006 Jan 19.
9
A new progressive-iterative algorithm for multiple structure alignment.
Bioinformatics. 2005 Aug 1;21(15):3255-63. doi: 10.1093/bioinformatics/bti527. Epub 2005 Jun 7.
10
TM-align: a protein structure alignment algorithm based on the TM-score.
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验