Sankoff D
Proc Natl Acad Sci U S A. 1972 Jan;69(1):4-6. doi: 10.1073/pnas.69.1.4.
Given two finite sequences, we wish to find the longest common subsequences satisfying certain deletion/insertion constraints. Consider two successive terms in the desired subsequence. The distance between their positions must be the same in the two original sequences for all but a limited number of such pairs of successive terms. Needleman and Wunsch gave an algorithm for finding longest common subsequences without constraints. This is improved from the viewpoint of computational economy. An economical algorithm is then elaborated for finding subsequences satisfying deletion/insertion constraints. This result is useful in the study of genetic homology based on nucleotide or amino-acid sequences.
给定两个有限序列,我们希望找到满足特定删除/插入约束的最长公共子序列。考虑所需子序列中的两个连续项。对于除了有限数量的此类连续项对之外的所有项,它们在两个原始序列中的位置之间的距离必须相同。Needleman和Wunsch给出了一种用于找到无约束最长公共子序列的算法。从计算经济性的角度对其进行了改进。然后精心设计了一种经济算法来找到满足删除/插入约束的子序列。该结果在基于核苷酸或氨基酸序列的遗传同源性研究中很有用。