Center for Molecular Medicine and Genetics, Wayne State University Medical School, Detroit, MI 48201, USA;
Department of Physics, Tianjin University, Tianjin 300072, China.
Curr Genomics. 2014 Apr;15(2):78-94. doi: 10.2174/1389202915999140328162433.
In theoretical physics, there exist two basic mathematical approaches, algebraic and geometrical methods, which, in most cases, are complementary. In the area of genome sequence analysis, however, algebraic approaches have been widely used, while geometrical approaches have been less explored for a long time. The Z-curve theory is a geometrical approach to genome analysis. The Z-curve is a three-dimensional curve that represents a given DNA sequence in the sense that each can be uniquely reconstructed given the other. The Z-curve, therefore, contains all the information that the corresponding DNA sequence carries. The analysis of a DNA sequence can then be performed through studying the corresponding Z-curve. The Z-curve method has found applications in a wide range of areas in the past two decades, including the identifications of protein-coding genes, replication origins, horizontally-transferred genomic islands, promoters, translational start sides and isochores, as well as studies on phylogenetics, genome visualization and comparative genomics. Here, we review the progress of Z-curve studies from aspects of both theory and applications in genome analysis.
在理论物理中,存在两种基本的数学方法,代数方法和几何方法,它们在大多数情况下是互补的。然而,在基因组序列分析领域,代数方法得到了广泛的应用,而几何方法长期以来一直没有得到充分的探索。Z 曲线理论是一种用于基因组分析的几何方法。Z 曲线是一种三维曲线,它表示给定的 DNA 序列,因为给定其中一个就可以唯一地重建另一个。因此,Z 曲线包含了相应 DNA 序列所携带的所有信息。然后可以通过研究相应的 Z 曲线来分析 DNA 序列。在过去的二十年中,Z 曲线方法已经在广泛的领域得到了应用,包括鉴定蛋白质编码基因、复制起点、水平转移基因组岛、启动子、翻译起始侧和同调区,以及在系统发育学、基因组可视化和比较基因组学方面的研究。在这里,我们从理论和应用两个方面综述了 Z 曲线在基因组分析中的研究进展。