Zhang Min, Fang Weiwu, Zhang Junhua, Chi Zhongxian
Department of Computer Science and Engineering, Dalian University of Technology, Dalian 116024, China.
Comput Biol Chem. 2005 Apr;29(2):175-81. doi: 10.1016/j.compbiolchem.2004.12.005.
We propose an algorithm of global multiple sequence alignment that is based on a measure of what we call information discrepancy. The algorithm follows a progressive alignment iteration strategy that makes use of what we call a function of degree of disagreement (FDOD). MSAID begins with distance calculation of pairwise sequences, based on FDOD as a numerical scoring measure. In the next step, the resulting distance matrix is used to construct a guide tree via the neighbor-joining method. The tree is then used to produce a multiple alignment. Current alignment is next used to produce a new matrix and a new tree (with FDOD scoring measure again). This iterative process continues until convergence criteria (or a stopping rule) are satisfied. MSAID was tested and compared with other prior methods by using reference alignments from BAliBASE 2.01. For the alignments with no large N/C-terminal extensions or internal insertions MSAID received the top overall average in the tests. Moreover, the results of testing indicate that MSAID performs as well as other alignment methods with an occasional tendency to perform better than these prior techniques. We, therefore, believe that MSAID is a solid and reliable method of choice, which is often (if not always) superior to other global alignment techniques.
我们提出了一种基于我们所谓的信息差异度量的全局多序列比对算法。该算法遵循一种渐进比对迭代策略,利用我们所谓的不一致度函数(FDOD)。MSAID首先基于FDOD作为数值评分度量来计算成对序列的距离。在下一步中,所得的距离矩阵通过邻接法用于构建引导树。然后使用该树生成多序列比对。接下来,当前的比对用于生成新的矩阵和新的树(再次使用FDOD评分度量)。这个迭代过程持续进行,直到满足收敛标准(或停止规则)。通过使用来自BAliBASE 2.01的参考比对,对MSAID进行了测试并与其他先前方法进行了比较。对于没有大的N/C末端延伸或内部插入的比对,MSAID在测试中获得了最高的总体平均分。此外,测试结果表明,MSAID的性能与其他比对方法相当,偶尔还有比这些先前技术表现更好的趋势。因此,我们认为MSAID是一种可靠且值得选择的方法,它通常(如果不是总是)优于其他全局比对技术。