Nair Achuthsankar S, Mahalakshmi T
Department of Computer Science, University of Kerala, India - 695 581.
In Silico Biol. 2006;6(3):215-22.
This paper reports a novel symbol-to-signal mapping for DNA sequences, based on the concept of categorical periodograms. A categorical periodogram is a numeric sequence with the n-th element of the sequence indicating the number of occurrences of cycles with period n in it. The period of the cycle is defined as the number of intervening events plus one. Spectral analysis studies have been conducted on Cumulative Categorical Periodogram (CCP) of 10 genes from the data set of Burset and Guigo. It is observed that the spectral signatures in CCP are functionally equivalent to the established N/3 peak in the spectrum of indicator sequences of genomes. Being a single sequence compared to four sequences in the case of indicator sequence representation, the method is claimed to be functionally equivalent, but computationally better for identification of gene coding regions in sequences.
本文报道了一种基于分类周期图概念的新型DNA序列符号到信号映射。分类周期图是一个数字序列,该序列的第n个元素表示其中周期为n的循环出现的次数。循环的周期定义为中间事件的数量加1。对来自Burset和Guigo数据集的10个基因的累积分类周期图(CCP)进行了频谱分析研究。据观察,CCP中的频谱特征在功能上等同于基因组指示序列频谱中已确立的N/3峰值。与指示序列表示中的四个序列相比,该方法是单个序列,据称在功能上等效,但在计算上更有利于识别序列中的基因编码区域。