School of Information and Electronic Engineering, Wuzhou University, Wuzhu, China.
College of Computer Science and Electronic Engineering, Hunan University, Hunan, China.
Sci Rep. 2018 May 15;8(1):7592. doi: 10.1038/s41598-018-26005-3.
One novel representation of DNA sequence combining the global and local position information of the original sequence has been proposed to distinguish the different species. First, for the sufficient exploitation of global information, one graphical representation of DNA sequence has been formulated according to the curve of Fermat spiral. Then, for the consideration of local characteristics of DNA sequence, attaching each point in the curve of Fermat spiral with the related mass has been applied based on the relationships of neighboring four nucleotides. In this paper, the normalized moments of inertia of the curve of Fermat spiral which composed by the points with mass has been calculated as the numerical description of the corresponding DNA sequence on the first exons of beta-global genes. Choosing the Euclidean distance as the measurement of the numerical descriptions, the similarity between species has shown the performance of proposed method.
已经提出了一种将 DNA 序列的全局和局部位置信息结合在一起的新表示方法,以区分不同的物种。首先,为了充分利用全局信息,根据费马螺线的曲线,制定了一种 DNA 序列的图形表示方法。然后,为了考虑 DNA 序列的局部特征,根据相邻四个核苷酸的关系,将费马螺线曲线上的每个点与相关质量附加在一起。在本文中,计算了由带质量的点组成的费马螺线的归一化惯性矩,作为β-全局基因第一个外显子上相应 DNA 序列的数值描述。选择欧几里得距离作为数值描述的度量,物种间的相似性显示了所提出方法的性能。