Konagurthu Arun S, Whisstock James C, Stuckey Peter J, Lesk Arthur M
Department of Computer Science and Software Engineering, The University of Melbourne, Parkville, Melbourne, Victoria, 3010 Australia.
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
Multiple structural alignment is a fundamental problem in structural genomics. In this article, we define a reliable and robust algorithm, MUSTANG (MUltiple STructural AligNment AlGorithm), for the alignment of multiple protein structures. Given a set of protein structures, the program constructs a multiple alignment using the spatial information of the C(alpha) atoms in the set. Broadly based on the progressive pairwise heuristic, this algorithm gains accuracy through novel and effective refinement phases. MUSTANG reports the multiple sequence alignment and the corresponding superposition of structures. Alignments generated by MUSTANG are compared with several handcurated alignments in the literature as well as with the benchmark alignments of 1033 alignment families from the HOMSTRAD database. The performance of MUSTANG was compared with DALI at a pairwise level, and with other multiple structural alignment tools such as POSA, CE-MC, MALECON, and MultiProt. MUSTANG performs comparably to popular pairwise and multiple structural alignment tools for closely related proteins, and performs more reliably than other multiple structural alignment methods on hard data sets containing distantly related proteins or proteins that show conformational changes.
多结构比对是结构基因组学中的一个基本问题。在本文中,我们定义了一种可靠且强大的算法——MUSTANG(多结构比对算法),用于对多个蛋白质结构进行比对。给定一组蛋白质结构,该程序利用集合中Cα原子的空间信息构建多序列比对。该算法大致基于渐进式成对启发式方法,通过新颖且有效的优化阶段提高准确性。MUSTANG会报告多序列比对以及相应的结构叠加。将MUSTANG生成的比对结果与文献中的几个手工整理的比对结果以及来自HOMSTRAD数据库的1033个比对家族的基准比对结果进行比较。在成对水平上,将MUSTANG的性能与DALI进行比较,并与其他多结构比对工具(如POSA、CE - MC、MALECON和MultiProt)进行比较。对于密切相关的蛋白质,MUSTANG的性能与流行的成对和多结构比对工具相当,而在包含远缘相关蛋白质或显示构象变化的蛋白质的困难数据集上,MUSTANG的表现比其他多结构比对方法更可靠。