Henikoff S, Henikoff J G
Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington 98104.
J Mol Biol. 1994 Nov 4;243(4):574-8. doi: 10.1016/0022-2836(94)90032-9.
Sequence weighting methods have been used to reduce redundancy and emphasize diversity in multiple sequence alignment and searching applications. Each of these methods is based on a notion of distance between a sequence and an ancestral or generalized sequence. We describe a different approach, which bases weights on the diversity observed at each position in the alignment, rather than on a sequence distance measure. These position-based weights make minimal assumptions, are simple to compute, and perform well in comprehensive evaluations.
序列加权方法已被用于减少冗余,并在多序列比对和搜索应用中强调多样性。这些方法中的每一种都基于序列与祖先序列或广义序列之间的距离概念。我们描述了一种不同的方法,该方法基于比对中每个位置观察到的多样性来确定权重,而不是基于序列距离度量。这些基于位置的权重假设最少,易于计算,并且在综合评估中表现良好。