Correa Leonardo, Borguesan Bruno, Farfan Camilo, Inostroza-Ponta Mario, Dorn Marcio
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):690-704. doi: 10.1109/TCBB.2016.2635143. Epub 2016 Dec 2.
Memetic Algorithms are population-based metaheuristics intrinsically concerned with exploiting all available knowledge about the problem under study. The incorporation of problem domain knowledge is not an optional mechanism, but a fundamental feature of the Memetic Algorithms. In this paper, we present a Memetic Algorithm to tackle the three-dimensional protein structure prediction problem. The method uses a structured population and incorporates a Simulated Annealing algorithm as a local search strategy, as well as ad-hoc crossover and mutation operators to deal with the problem. It takes advantage of structural knowledge stored in the Protein Data Bank, by using an Angle Probability List that helps to reduce the search space and to guide the search strategy. The proposed algorithm was tested on nineteen protein sequences of amino acid residues, and the results show the ability of the algorithm to find native-like protein structures. Experimental results have revealed that the proposed algorithm can find good solutions regarding root-mean-square deviation and global distance total score test in comparison with the experimental protein structures. We also show that our results are comparable in terms of folding organization with state-of-the-art prediction methods, corroborating the effectiveness of our proposal.
Memetic算法是基于群体的元启发式算法,本质上关注于利用有关所研究问题的所有可用知识。纳入问题领域知识不是一种可选机制,而是Memetic算法的一个基本特征。在本文中,我们提出一种Memetic算法来解决三维蛋白质结构预测问题。该方法使用结构化群体,并纳入模拟退火算法作为局部搜索策略,以及专门的交叉和变异算子来处理该问题。它通过使用角度概率列表利用存储在蛋白质数据库中的结构知识,这有助于减少搜索空间并指导搜索策略。所提出的算法在19个氨基酸残基的蛋白质序列上进行了测试,结果表明该算法能够找到类似天然的蛋白质结构。实验结果表明,与实验蛋白质结构相比,所提出的算法在均方根偏差和全局距离总分测试方面能够找到良好的解决方案。我们还表明,在折叠组织方面,我们的结果与最先进的预测方法相当,证实了我们提议的有效性。