School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing 210044, China.
Comput Biol Chem. 2012 Jun;38:17-26. doi: 10.1016/j.compbiolchem.2012.02.001. Epub 2012 Mar 5.
The protein folding problem, i.e., the prediction of the tertiary structures of protein molecules from their amino acid sequences is one of the most important problems in computational biology and biochemistry. However, the extremely difficult optimization problem arising from energy function is a key challenge in protein folding simulation. The energy landscape paving (ELP) method has already been applied very successfully to off-lattice protein models and other optimization problems with complex energy landscape in continuous space. By improving the ELP method, and subsequently incorporating the neighborhood strategy with the pull-move set into the improved ELP method, a heuristic ELP algorithm is proposed to find low-energy conformations of 3D HP lattice model proteins in the discrete space. The algorithm is tested on three sets of 3D HP benchmark instances consisting 31 sequences. For eleven sequences with 27 monomers, the proposed method explores the conformation surfaces more efficiently than other methods, and finds new lower energies in several cases. For ten 48-monomer sequences, we find the lowest energies so far. With the achieved results, the algorithm converges rapidly and efficiently. For all ten 64-monomer sequences, the algorithm finds lower energies within comparable computation times than previous methods. Numeric results show that the heuristic ELP method is a competitive tool for protein folding simulation in 3D lattice model. To the best of our knowledge, this is the first application of ELP to the 3D discrete space.
蛋白质折叠问题,即从氨基酸序列预测蛋白质分子的三级结构,是计算生物学和生物化学中最重要的问题之一。然而,能量函数带来的极其困难的优化问题是蛋白质折叠模拟中的一个关键挑战。能量景观铺平(ELP)方法已经非常成功地应用于无网格蛋白质模型和其他连续空间中具有复杂能量景观的优化问题。通过改进 ELP 方法,并随后将邻域策略与拉动集纳入改进的 ELP 方法中,提出了一种启发式 ELP 算法,用于在离散空间中寻找 3D HP 晶格模型蛋白质的低能构象。该算法在三个由 31 个序列组成的 3D HP 基准实例集上进行了测试。对于具有 27 个单体的 11 个序列,该方法比其他方法更有效地探索构象表面,并在几种情况下找到了新的更低能量。对于十个 48-单体序列,我们找到了迄今为止的最低能量。通过所取得的结果,该算法快速有效地收敛。对于所有十个 64-单体序列,该算法在可比的计算时间内找到了更低的能量,优于先前的方法。数值结果表明,启发式 ELP 方法是 3D 晶格模型中蛋白质折叠模拟的一种有竞争力的工具。据我们所知,这是 ELP 首次应用于 3D 离散空间。