Song Jie, Tang Huanwen
Institute of Computational Biology and Bioinformatics, Dalian University of Technology, Dalian 116024, People's Republic of China.
J Biochem Biophys Methods. 2005 Jun 30;63(3):228-39. doi: 10.1016/j.jbbm.2005.04.004.
We consider a novel 2-D graphical representation of DNA sequences according to chemical structures of bases, reflecting distribution of bases with different chemical structure, preserving information on sequential adjacency of bases, and allowing numerical characterization. The representation avoids loss of information accompanying alternative 2-D representations in which the curve standing for DNA overlaps and intersects itself. Based on this representation we present a numerical characterization approach by the leading eigenvalues of the matrices associated with the DNA sequences. The utility of the approach is illustrated on the coding sequences of the first exon of human beta-globin gene.
我们考虑一种根据碱基化学结构对DNA序列进行的新型二维图形表示法,它反映了具有不同化学结构的碱基分布,保留了碱基序列邻接信息,并允许进行数值表征。这种表示法避免了伴随其他二维表示法出现的信息丢失,在其他二维表示法中,代表DNA的曲线会相互重叠和交叉。基于这种表示法,我们提出了一种通过与DNA序列相关矩阵的主特征值进行数值表征的方法。该方法的实用性在人类β-珠蛋白基因第一个外显子的编码序列上得到了说明。