Badr Ghada H, Al-Aqel Haifa A
IRI- The City of Scientific Research and Technological Applications, University and Research District, P. O. 21934, New Borg Alarab, Alexandria, Egypt.
University of Ottawa, Faculty of Engineering, Ottawa, Canada.
BMC Bioinformatics. 2017 Oct 16;18(Suppl 12):427. doi: 10.1186/s12859-017-1830-6.
Genome rearrangements are essential processes for evolution and are responsible for existing varieties of genome architectures. Many studies have been conducted to obtain an algorithm that identifies the minimum number of inversions that are necessary to transform one genome into another; this allows for genome sequence representation in polynomial time. Studies have not been conducted on the topic of rearranging a genome when it is represented as a secondary structure. Unlike sequences, the secondary structure preserves the functionality of the genome. Sequences can be different, but they all share the same structure and, therefore, the same functionality.
This paper proposes a double cut and join for RNA secondary structures (DCJ-RNA) algorithm. This algorithm allows for the description of evolutionary scenarios that are based on secondary structures rather than sequences. The main aim of this paper is to suggest an efficient algorithm that can help researchers compare two ribonucleic acid (RNA) secondary structures based on rearrangement operations. The results, which are based on real datasets, show that the algorithm is able to count the minimum number of rearrangement operations, as well as to report an optimum scenario that can increase the similarity between the two structures.
The algorithm calculates the distance between structures and reports a scenario based on the minimum rearrangement operations required to make the given structure similar to the other. DCJ-RNA can also be used to measure the distance between the two structures. This can help identify the common functionalities between different species.
基因组重排是进化的重要过程,决定了现有各种基因组结构。已经开展了许多研究来获得一种算法,该算法能够确定将一个基因组转化为另一个基因组所需的最小倒位数;这使得基因组序列能够在多项式时间内表示。然而,当基因组表示为二级结构时,尚未有关于重排基因组的研究。与序列不同,二级结构保留了基因组的功能。序列可能不同,但它们都具有相同的结构,因此具有相同的功能。
本文提出了一种针对RNA二级结构的双切割与连接(DCJ-RNA)算法。该算法能够描述基于二级结构而非序列的进化场景。本文的主要目的是提出一种高效算法,帮助研究人员基于重排操作比较两个核糖核酸(RNA)二级结构。基于真实数据集的结果表明,该算法能够计算重排操作的最小数量,并报告一个能够增加两个结构之间相似性的最优场景。
该算法计算结构之间的距离,并根据使给定结构与另一个结构相似所需的最小重排操作报告一个场景。DCJ-RNA还可用于测量两个结构之间的距离。这有助于识别不同物种之间的共同功能。