King's College London, Department of Computer Science, London WC2R 2LS, UK.
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S39. doi: 10.1186/1471-2105-11-S1-S39.
The protein folding problem remains one of the most challenging open problems in computational biology. Simplified models in terms of lattice structure and energy function have been proposed to ease the computational hardness of this optimization problem. Heuristic search algorithms and constraint programming are two common techniques to approach this problem. The present study introduces a novel hybrid approach to simulate the protein folding problem using constraint programming technique integrated within local search.
Using the face-centered-cubic lattice model and 20 amino acid pairwise interactions energy function for the protein folding problem, a constraint programming technique has been applied to generate the neighbourhood conformations that are to be used in generic local search procedure. Experiments have been conducted for a few small and medium sized proteins. Results have been compared with both pure constraint programming approach and local search using well-established local move set. Substantial improvements have been observed in terms of final energy values within acceptable runtime using the hybrid approach.
Constraint programming approaches usually provide optimal results but become slow as the problem size grows. Local search approaches are usually faster but do not guarantee optimal solutions and tend to stuck in local minima. The encouraging results obtained on the small proteins show that these two approaches can be combined efficiently to obtain better quality solutions within acceptable time. It also encourages future researchers on adopting hybrid techniques to solve other hard optimization problems.
蛋白质折叠问题仍然是计算生物学中最具挑战性的开放性问题之一。为了缓解这个优化问题的计算难度,已经提出了简化的格子结构和能量函数模型。启发式搜索算法和约束编程是两种常见的解决此问题的技术。本研究介绍了一种使用约束编程技术与局部搜索相结合的新的混合方法来模拟蛋白质折叠问题。
使用面心立方格子模型和 20 种氨基酸对相互作用能量函数作为蛋白质折叠问题的能量函数,应用约束编程技术生成局部搜索过程中要使用的邻域构象。对几个中小蛋白质进行了实验。将混合方法与纯约束编程方法和使用成熟局部移动集的局部搜索进行了比较。在可接受的运行时间内,混合方法在最终能量值方面取得了显著的改进。
约束编程方法通常提供最优结果,但随着问题规模的增长变得缓慢。局部搜索方法通常更快,但不能保证最优解,并且容易陷入局部最小值。在小蛋白质上获得的令人鼓舞的结果表明,这两种方法可以有效地结合起来,在可接受的时间内获得更好的质量解。它也鼓励未来的研究人员采用混合技术来解决其他困难的优化问题。