Mirzaei Soraya, Razmara Jafar, Lotfi Shahriar
Department of Computer Science, Faculty of Mathematics, Statistics, and Computer Science, University of Tabriz, Tabriz, Iran.
Bioimpacts. 2021;11(4):271-279. doi: 10.34172/bi.2021.37. Epub 2020 Jul 8.
Similarity analysis of protein structure is considered as a fundamental step to give insight into the relationships between proteins. The primary step in structural alignment is looking for the optimal correspondence between residues of two structures to optimize the scoring function. An exhaustive search for finding such a correspondence between two structures is intractable. In this paper, a hybrid method is proposed, namely GADP-align, for pairwise protein structure alignment. The proposed method looks for an optimal alignment using a hybrid method based on a genetic algorithm and an iterative dynamic programming technique. To this end, the method first creates an initial map of correspondence between secondary structure elements (SSEs) of two proteins. Then, a genetic algorithm combined with an iterative dynamic programming algorithm is employed to optimize the alignment. The GADP-align algorithm was employed to align 10 'difficult to align' protein pairs in order to evaluate its performance. The experimental study shows that the proposed hybrid method produces highly accurate alignments in comparison with the methods using exactly the dynamic programming technique. Furthermore, the proposed method prevents the local optimal traps caused by the unsuitable initial guess of the corresponding residues. The findings of this paper demonstrate that employing the genetic algorithm along with the dynamic programming technique yields highly accurate alignments between a protein pair by exploring the global alignment and avoiding trapping in local alignments.
蛋白质结构的相似性分析被视为深入了解蛋白质之间关系的基本步骤。结构比对的首要步骤是寻找两个结构残基之间的最优对应关系,以优化评分函数。对两个结构之间寻找这种对应关系进行穷举搜索是难以处理的。本文提出了一种用于蛋白质结构两两比对的混合方法,即GADP-align。该方法基于遗传算法和迭代动态规划技术,通过混合方法寻找最优比对。为此,该方法首先创建两个蛋白质二级结构元件(SSE)之间的初始对应图谱。然后,采用遗传算法结合迭代动态规划算法来优化比对。为评估GADP-align算法的性能,使用该算法对10对“难以比对”的蛋白质进行了比对。实验研究表明,与仅使用动态规划技术的方法相比,所提出的混合方法能产生高度准确的比对结果。此外,该方法避免了因对应残基初始猜测不合适而导致的局部最优陷阱。本文的研究结果表明,遗传算法与动态规划技术相结合,通过探索全局比对并避免陷入局部比对,能够在蛋白质对之间产生高度准确的比对结果。