Vingron M, Argos P
European Molecular Biology Laboratory, Heidelberg, Germany.
J Mol Biol. 1991 Mar 5;218(1):33-43. doi: 10.1016/0022-2836(91)90871-3.
Calculation of dot-matrices is a widespread tool in the search for sequence similarities. When sequences are distant, even this approach may fail to point out common regions. If several plots calculated for all members of a sequence set consistently displayed a similarity between them, this would increase its credibility. We present an algorithm to delineate dot-plot agreement. A novel procedure based on matrix multiplication is developed to identify common patterns and reliably aligned regions in a set of distantly related sequences. The algorithm finds motifs independent of input sequence lengths and reduces the dependence on gap penalties. When sequences share greater similarity, the same approach converts to a multiple sequence alignment procedure.
点阵计算是寻找序列相似性时广泛使用的工具。当序列差异较大时,即使这种方法也可能无法指出共同区域。如果针对序列集的所有成员计算的多个图谱一致显示出它们之间的相似性,这将增加其可信度。我们提出了一种描绘点阵一致性的算法。开发了一种基于矩阵乘法的新程序,以识别一组远缘相关序列中的共同模式和可靠比对区域。该算法能够找到与输入序列长度无关的基序,并减少对空位罚分的依赖。当序列具有更高的相似性时,相同的方法可转换为多序列比对程序。