Nini Rao, Lijun Qiu
School of Life Science & Technology, University of Electronic Science and Technology of China, Chengdu 610054, China.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2005 Aug;22(4):681-5.
The first problem to be solved is to map DNA sequences onto numerical sequences in bio molecular sequence analysis by mathematical, physical and digital signal processing methods. The characters and the adaptability of eight existing mapping methods are analyzed in this paper. A new numerical mapping method based on the probability of bases in the segment DNA sequence is presented. Most of the coding sequences are characterized by 3 - base periodicity. Further more, eight numerical mapping methods are compared and the new method is verified by means of the spectrum analysis of DNA sequence with 3 - base periodicity. The computer simulation results show that the mapping method based complex plane is superior to the other seven methods in reflecting the original information of the bio-molecular sequence and the quality of the obtained power spectra. The identification rate the new method attains is approximately what the complex plane method has achieved.
在生物分子序列分析中,首先要解决的问题是通过数学、物理和数字信号处理方法将DNA序列映射到数字序列上。本文分析了现有八种映射方法的特点及适用性。提出了一种基于DNA序列片段中碱基概率的新的数字映射方法。大多数编码序列具有3碱基周期性的特征。此外,对八种数字映射方法进行了比较,并通过对具有3碱基周期性的DNA序列进行频谱分析来验证新方法。计算机模拟结果表明,基于复平面的映射方法在反映生物分子序列的原始信息以及所获得的功率谱质量方面优于其他七种方法。新方法达到的识别率与复平面方法所达到的相近。