School of Information Science, Suihua University, Suihua 152061, China; Department of Mathematics, The Chinese University of Hong Kong, Shatin 999077, Hong Kong.
Department of Mathematics, The Chinese University of Hong Kong, Shatin 999077, Hong Kong.
Gene. 2014 Aug 1;546(1):25-34. doi: 10.1016/j.gene.2014.05.043. Epub 2014 May 22.
Based on the well-known k-mer model, we propose a k-mer natural vector model for representing a genetic sequence based on the numbers and distributions of k-mers in the sequence. We show that there exists a one-to-one correspondence between a genetic sequence and its associated k-mer natural vector. The k-mer natural vector method can be easily and quickly used to perform phylogenetic analysis of genetic sequences without requiring evolutionary models or human intervention. Whole or partial genomes can be handled more effective with our proposed method. It is applied to the phylogenetic analysis of genetic sequences, and the obtaining results fully demonstrate that the k-mer natural vector method is a very powerful tool for analysing and annotating genetic sequences and determining evolutionary relationships both in terms of accuracy and efficiency.
基于著名的 k-mer 模型,我们提出了一种 k-mer 自然向量模型,用于根据序列中的 k-mer 的数量和分布来表示遗传序列。我们表明,遗传序列与其相关的 k-mer 自然向量之间存在一一对应关系。k-mer 自然向量方法可以很容易和快速地用于进行遗传序列的系统发生分析,而不需要进化模型或人为干预。我们提出的方法可以更有效地处理整个或部分基因组。它被应用于遗传序列的系统发生分析,得到的结果充分证明了 k-mer 自然向量方法是一种非常强大的工具,用于分析和注释遗传序列,并确定进化关系,无论是在准确性还是效率方面。