Department of Mathematics, Luoyang Normal University, Luoyang 471022, PR China.
Math Biosci. 2010 Oct;227(2):147-52. doi: 10.1016/j.mbs.2010.07.004. Epub 2010 Aug 3.
We consider to construct 4(L)-components vectors for a DNA primary sequence based on the L-tuple. For two DNA sequences, using the corresponding vectors, we construct a set of L x L matrices called related matrix. The mathematical characterization from the constructed matrices have been selected to characterize the degree of similarity between the two DNA sequences. The search for similar sequences of a query sequence from a database of 39 library sequences and the construction of phylogenetic tree of H5N1 avian influenza virus illustrate the utility of the matrices for DNA sequences.
我们考虑基于 L-元组为 DNA 一级序列构造 4(L)-分量向量。对于两个 DNA 序列,使用相应的向量,我们构造了一组 L x L 的矩阵,称为相关矩阵。从构造的矩阵中进行数学特征选择,以表征两个 DNA 序列之间的相似程度。从 39 个库序列的数据库中查询序列的相似序列搜索以及 H5N1 禽流感病毒的系统发育树的构建,说明了这些矩阵在 DNA 序列中的实用性。