蛋白质的多重结构比对和共识识别。

Multiple structure alignment and consensus identification for proteins.

机构信息

Department of Computer Science, Gettysburg College, Gettysburg, PA, USA.

出版信息

BMC Bioinformatics. 2010 Feb 2;11:71. doi: 10.1186/1471-2105-11-71.

DOI:10.1186/1471-2105-11-71

PMID:20122279

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2829528/

Abstract

BACKGROUND

An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins.

RESULTS

Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank.

CONCLUSIONS

An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.

摘要

背景

本文提出了一种算法，用于计算一组蛋白质的多重结构比对，并生成一个共识（伪）蛋白质，该蛋白质捕捉到给定蛋白质中存在的常见子结构。该算法将每个蛋白质表示为沿骨架的α-碳原子坐标的三重序列。然后，它迭代计算一系列变换矩阵（即平移和旋转），以在空间中对齐蛋白质并生成共识。该算法是一种启发式算法，它计算出最佳对齐的近似值，该值最小化共识与变换后的蛋白质之间的成对距离的总和。

结果

实验结果表明，该算法收敛速度相当快，生成的共识结构与输入蛋白质在视觉上相似。与其他基于坐标的对齐算法（MAMMOTH 和 MATT）的比较表明，该算法在速度和在从 HOMSTRAD 和 SABmark 数据库派生的广泛基准数据集发现的保守区域的大小方面具有竞争力。该算法已用 C++实现，并可从项目网页下载。或者，可以通过一个 Web 服务器使用该算法，该服务器允许通过从本地磁盘上传文件或从 RCSB 蛋白质数据库下载蛋白质数据来对齐蛋白质结构。

结论

本文提出了一种算法，用于计算一组蛋白质及其共识结构的多重结构比对。实验结果表明，该算法在对齐质量和计算成本方面都具有有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e2/2829528/bcdf39303c68/1471-2105-11-71-1.jpg

相似文献

Multiple structure alignment and consensus identification for proteins.蛋白质的多重结构比对和共识识别。

BMC Bioinformatics. 2010 Feb 2;11:71. doi: 10.1186/1471-2105-11-71.

Approximate multiple protein structure alignment using the sum-of-pairs distance.使用对和距离进行近似多蛋白质结构比对。

J Comput Biol. 2004;11(5):986-1000. doi: 10.1089/cmb.2004.11.986.

RCSB protein Data Bank: exploring protein 3D similarities via comprehensive structural alignments.RCSB 蛋白质数据库：通过全面的结构比对探索蛋白质 3D 相似性。

Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae370.

mTM-align: an algorithm for fast and accurate multiple protein structure alignment.mTM-align：一种快速准确的多蛋白质结构比对算法。

Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.

Matt: local flexibility aids protein multiple structure alignment.马特：局部灵活性有助于蛋白质多结构比对。

PLoS Comput Biol. 2008 Jan;4(1):e10. doi: 10.1371/journal.pcbi.0040010.

Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align：一种基于片段比对和TM分数的新型蛋白质结构比对方法。

BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.

Multiple structure alignment with msTALI.多结构比对与 msTALI。

BMC Bioinformatics. 2012 May 20;13:105. doi: 10.1186/1471-2105-13-105.

Pairwise protein structure alignment based on an orientation-independent backbone representation.基于与方向无关的主链表示的成对蛋白质结构比对。

J Bioinform Comput Biol. 2004 Dec;2(4):699-717. doi: 10.1142/s021972000400082x.

mTM-align: a server for fast protein structure database search and multiple protein structure alignment.mTM-align：一个用于快速蛋白质结构数据库搜索和多蛋白质结构比对的服务器。

Nucleic Acids Res. 2018 Jul 2;46(W1):W380-W386. doi: 10.1093/nar/gky430.

DALIX: optimal DALI protein structure alignment.DALIX：最佳 DALI 蛋白结构比对。

IEEE/ACM Trans Comput Biol Bioinform. 2013 Jan-Feb;10(1):26-36. doi: 10.1109/TCBB.2012.143.

引用本文的文献

In silico analysis of class I adenylate-forming enzymes reveals family and group-specific conservations.计算机分析 I 类腺苷酸形成酶揭示了家族和组特异性的保守性。

PLoS One. 2018 Sep 4;13(9):e0203218. doi: 10.1371/journal.pone.0203218. eCollection 2018.

mTM-align: an algorithm for fast and accurate multiple protein structure alignment.mTM-align：一种快速准确的多蛋白质结构比对算法。

Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.

analysis of heme oxygenase structural homologues identifies group-specific conservations.对血红素加氧酶结构同源物的分析确定了特定组的保守性。

FEBS Open Bio. 2017 Sep 4;7(10):1480-1498. doi: 10.1002/2211-5463.12275. eCollection 2017 Oct.

Structure and function of α-glucan debranching enzymes.α-葡聚糖脱支酶的结构与功能

Cell Mol Life Sci. 2016 Jul;73(14):2619-41. doi: 10.1007/s00018-016-2241-y. Epub 2016 May 2.

Crystal structure of HINT from Helicobacter pylori.幽门螺杆菌HINT的晶体结构。

Acta Crystallogr F Struct Biol Commun. 2016 Jan;72(Pt 1):42-8. doi: 10.1107/S2053230X15023316. Epub 2016 Jan 1.

Plasmodium falciparum infection induces expression of a mosquito salivary protein (Agaphelin) that targets neutrophil function and inhibits thrombosis without impairing hemostasis.恶性疟原虫感染会诱导一种蚊子唾液蛋白（Agaphelin）的表达，该蛋白靶向中性粒细胞功能并抑制血栓形成，同时不损害止血功能。

PLoS Pathog. 2014 Sep 11;10(9):e1004338. doi: 10.1371/journal.ppat.1004338. eCollection 2014 Sep.

Identifying RNA-binding residues based on evolutionary conserved structural and energetic features.基于进化保守的结构和能量特征鉴定 RNA 结合残基。

Nucleic Acids Res. 2014 Feb;42(3):e15. doi: 10.1093/nar/gkt1299. Epub 2013 Dec 16.

Alignment of distantly related protein structures: algorithm, bound and implications to homology modeling.远缘蛋白质结构的比对：算法、界限及其同源建模的意义。

Bioinformatics. 2011 Sep 15;27(18):2537-45. doi: 10.1093/bioinformatics/btr432. Epub 2011 Jul 26.

The PE-PPE domain in mycobacterium reveals a serine α/β hydrolase fold and function: an in-silico analysis.分枝杆菌的 PE-PPE 结构域揭示了丝氨酸 α/β 水解酶折叠和功能：计算机分析。

PLoS One. 2011 Feb 10;6(2):e16745. doi: 10.1371/journal.pone.0016745.

本文引用的文献

Alignment of multiple protein structures based on sequence and structure features.基于序列和结构特征对多个蛋白质结构进行比对。

Protein Eng Des Sel. 2009 Sep;22(9):569-74. doi: 10.1093/protein/gzp040. Epub 2009 Jul 8.

Jalview Version 2--a multiple sequence alignment editor and analysis workbench.Jalview 2版本——一个多序列比对编辑器和分析工作台。

Bioinformatics. 2009 May 1;25(9):1189-91. doi: 10.1093/bioinformatics/btp033. Epub 2009 Jan 16.

Matt: local flexibility aids protein multiple structure alignment.马特：局部灵活性有助于蛋白质多结构比对。

PLoS Comput Biol. 2008 Jan;4(1):e10. doi: 10.1371/journal.pcbi.0040010.

A new progressive-iterative algorithm for multiple structure alignment.一种用于多结构比对的新型渐进迭代算法。

Bioinformatics. 2005 Aug 1;21(15):3255-63. doi: 10.1093/bioinformatics/bti527. Epub 2005 Jun 7.

Approximate multiple protein structure alignment using the sum-of-pairs distance.使用对和距离进行近似多蛋白质结构比对。

J Comput Biol. 2004;11(5):986-1000. doi: 10.1089/cmb.2004.11.986.

Pairwise protein structure alignment based on an orientation-independent backbone representation.基于与方向无关的主链表示的成对蛋白质结构比对。

J Bioinform Comput Biol. 2004 Dec;2(4):699-717. doi: 10.1142/s021972000400082x.

SABmark--a benchmark for sequence alignment that covers the entire known fold space.SABmark——一种涵盖整个已知折叠空间的序列比对基准。

Bioinformatics. 2005 Apr 1;21(7):1267-8. doi: 10.1093/bioinformatics/bth493. Epub 2004 Aug 27.

Comparison of performance in successive CASP experiments.连续几届蛋白质结构预测关键评估（CASP）实验中的性能比较。

Proteins. 2001;Suppl 5:163-70. doi: 10.1002/prot.10053.

A new algorithm for the alignment of multiple protein structures using Monte Carlo optimization.一种使用蒙特卡罗优化算法对多个蛋白质结构进行比对的新算法。

Pac Symp Biocomput. 2001:275-86. doi: 10.1142/9789814447362_0028.

The Protein Data Bank.蛋白质数据库。

Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

蛋白质的多重结构比对和共识识别。

Multiple structure alignment and consensus identification for proteins.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献