Brief Bioinform. 2019 Jul 19;20(4):1114-1124. doi: 10.1093/bib/bbx174.
DNA replication begins at replication origins in all three domains of life. Identification and characterization of replication origins are important not only in providing insights into the structure and function of the replication origins but also in understanding the regulatory mechanisms of the initiation step in DNA replication. The Z-curve method has been used in the identification of replication origins in archaeal genomes successfully since 2002. Furthermore, the Web servers of Ori-Finder and Ori-Finder 2 have been developed to predict replication origins in both bacterial and archaeal genomes based on the Z-curve method, and the replication origins with manual curation have been collected into an online database, DoriC. Ori-Finder system and DoriC database are currently used in the research field of DNA replication origins in prokaryotes, including: (i) identification of oriC regions in bacterial and archaeal genomes; (ii) discovery and analysis of the conserved sequences within oriC regions; and (iii) strand-biased analysis of bacterial genomes. Up to now, more and more predicted results by Ori-Finder system were supported by subsequent experiments, and Ori-Finder system has been used to identify the replication origins in > 100 newly sequenced prokaryotes in their genome reports. In addition, the data in DoriC database have been widely used in the large-scale analyses of replication origins and strand bias in prokaryotic genomes. Here, we review the development of Ori-Finder system and DoriC database as well as their applications. Some future directions and aspects for extending the application of Ori-Finder and DoriC are also presented.
DNA 复制始于所有三个生命领域的复制原点。识别和表征复制原点不仅对于深入了解复制原点的结构和功能很重要,而且对于理解 DNA 复制起始步骤的调控机制也很重要。自 2002 年以来,Z 曲线法已成功用于古菌基因组中复制原点的识别。此外,还开发了 Ori-Finder 和 Ori-Finder 2 的网络服务器,以便基于 Z 曲线法预测细菌和古菌基因组中的复制原点,并且已经将经过人工整理的复制原点收集到一个在线数据库 DoriC 中。Ori-Finder 系统和 DoriC 数据库目前用于原核生物 DNA 复制原点的研究领域,包括:(i)在细菌和古菌基因组中鉴定 oriC 区域;(ii)发现和分析 oriC 区域内的保守序列;以及(iii)细菌基因组的链偏析分析。到目前为止,越来越多的 Ori-Finder 系统预测结果得到了后续实验的支持,并且 Ori-Finder 系统已用于在其基因组报告中识别超过 100 个新测序的原核生物的复制原点。此外,DoriC 数据库中的数据已广泛用于原核生物基因组中复制原点和链偏析的大规模分析。在这里,我们回顾了 Ori-Finder 系统和 DoriC 数据库的发展以及它们的应用。还提出了扩展 Ori-Finder 和 DoriC 应用的一些未来方向和方面。