Gupta S K, Kececioglu J D, Schäffer A A
Department of Computer Science, Rice University, Houston, TX 77005-1892, USA.
J Comput Biol. 1995 Fall;2(3):459-72. doi: 10.1089/cmb.1995.2.459.
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a significant multiplicative factor speedup in running time. To explain how these improvements work, we give a much more detailed description of MSA than has been previously available. In practice, MSA rarely produces a provably optimal alignment and we explain why.
MSA程序于1989年编写并发布,是少数几个试图找到多个蛋白质或DNA序列的最佳比对的现有程序之一。MSA程序采用分支定界技术并结合迪杰斯特拉最短路径算法的一个变体来修剪基本动态规划图。我们在MSA的时间和空间使用方面取得了显著改进。这些改进使以前不可行的各种问题实例变得可行。在某些运行中,我们实现了空间使用减少一个数量级,运行时间显著加快。为了解释这些改进是如何工作的,我们对MSA进行了比以前更详细的描述。在实践中,MSA很少能产生可证明的最优比对,我们对此进行了解释。