Brunette T J, Brock Oliver
Robotics and Biology Laboratory, Department of Computer Science, University of Massachusetts Amherst, Amherst, Massachusetts 01003-9264, USA.
Proteins. 2008 Dec;73(4):958-72. doi: 10.1002/prot.22123.
The most significant impediment for protein structure prediction is the inadequacy of conformation space search. Conformation space is too large and the energy landscape too rugged for existing search methods to consistently find near-optimal minima. To alleviate this problem, we present model-based search, a novel conformation space search method. Model-based search uses highly accurate information obtained during search to build an approximate, partial model of the energy landscape. Model-based search aggregates information in the model as it progresses, and in turn uses this information to guide exploration toward regions most likely to contain a near-optimal minimum. We validate our method by predicting the structure of 32 proteins, ranging in length from 49 to 213 amino acids. Our results demonstrate that model-based search is more effective at finding low-energy conformations in high-dimensional conformation spaces than existing search methods. The reduction in energy translates into structure predictions of increased accuracy.
蛋白质结构预测面临的最重大障碍是构象空间搜索的不足。构象空间过于庞大,能量景观过于崎岖,以至于现有的搜索方法无法始终找到接近最优的最小值。为了缓解这一问题,我们提出了基于模型的搜索,这是一种新颖的构象空间搜索方法。基于模型的搜索利用搜索过程中获得的高精度信息来构建能量景观的近似部分模型。基于模型的搜索在推进过程中在模型中聚合信息,进而利用这些信息引导探索朝着最有可能包含接近最优最小值的区域进行。我们通过预测32种蛋白质的结构来验证我们的方法,这些蛋白质的长度从49个氨基酸到213个氨基酸不等。我们的结果表明,在高维构象空间中寻找低能量构象时,基于模型的搜索比现有搜索方法更有效。能量的降低转化为准确性更高的结构预测。