IEEE/ACM Trans Comput Biol Bioinform. 2017 Nov-Dec;14(6):1288-1301. doi: 10.1109/TCBB.2016.2566617. Epub 2016 May 11.
De novo protein structure prediction aims to search for low-energy conformations as it follows the thermodynamics hypothesis that places native conformations at the global minimum of the protein energy surface. However, the native conformation is not necessarily located in the lowest-energy regions owing to the inaccuracies of the energy model. This study presents a differential evolution algorithm using distance profile-based selection strategy to sample conformations with reasonable structure effectively. In the proposed algorithm, besides energy, the residue-residue distance is considered another measure of the conformation. The average distance errors of decoys between the distance of each residue pair and the corresponding distance in the distance profiles are first calculated when the trial conformation yields a larger energy value than that of the target. Then, the distance acceptance probability of the trial conformation is designed based on distance profiles if the trial conformation obtains a lower average distance error compared with that of the target conformation. The trial conformation is accepted to the next generation in accordance with its distance acceptance probability. By using the dual constraints of energy and distance in guiding sampling, the algorithm can sample conformations with lower energies and more reasonable structures. Experimental results of 28 benchmark proteins show that the proposed algorithm can effectively predict near-native protein structures.
从头蛋白质结构预测旨在搜索低能量构象,因为它遵循热力学假说,即天然构象位于蛋白质能量表面的全局最低点。然而,由于能量模型的不准确性,天然构象不一定位于最低能量区域。本研究提出了一种基于距离剖面的选择策略的差分进化算法,以有效地采样具有合理结构的构象。在提出的算法中,除了能量之外,残基-残基距离被认为是构象的另一个度量。当试探构象的能量值大于目标构象的能量值时,首先计算出试探构象的每个残基对之间的距离与距离剖面中相应距离之间的平均距离误差。然后,如果试探构象的平均距离误差低于目标构象的平均距离误差,则基于距离剖面设计试探构象的距离接受概率。如果试探构象的距离接受概率低于目标构象的距离接受概率,则根据其距离接受概率将试探构象接受为下一代。通过在指导采样中使用能量和距离的双重约束,该算法可以采样具有更低能量和更合理结构的构象。28 个基准蛋白质的实验结果表明,该算法可以有效地预测近天然蛋白质结构。