Mac Dónaill D A
Department of Chemistry, Trinity College University of Dublin, Ireland.
Comput Appl Biosci. 1995 Oct;11(5):567-9.
A method exploiting quaternary integer arithmetic for the rapid comparison of nucleotide sequences is proposed. A central feature of the method is the identification of a suitable integer arithmetic operation which, operating on two numerical strings (representing two sequences), yields an output string unambiguously related to the number of matching nucleotides in the compared sequences. The number of matching nucleotide bases in compared sequences can be determined without having to make individual base by base comparisons. The method is a general extension of the algorithm previously described in the code UNIREP. The rules governing integer arithmetic as applied to nucleotide sequences are discussed.
提出了一种利用四元整数算法快速比较核苷酸序列的方法。该方法的一个核心特点是确定一种合适的整数算术运算,该运算作用于两个数字字符串(代表两个序列),产生一个与比较序列中匹配核苷酸数量明确相关的输出字符串。无需逐个碱基进行比较,就能确定比较序列中匹配核苷酸碱基的数量。该方法是先前在代码UNIREP中描述的算法的一般扩展。讨论了应用于核苷酸序列的整数算术运算规则。