Simons K T, Kooperberg C, Huang E, Baker D
Department of Biochemistry, University of Washington, Seattle 98195, USA.
J Mol Biol. 1997 Apr 25;268(1):209-25. doi: 10.1006/jmbi.1997.0959.
We explore the ability of a simple simulated annealing procedure to assemble native-like structures from fragments of unrelated protein structures with similar local sequences using Bayesian scoring functions. Environment and residue pair specific contributions to the scoring functions appear as the first two terms in a series expansion for the residue probability distributions in the protein database; the decoupling of the distance and environment dependencies of the distributions resolves the major problems with current database-derived scoring functions noted by Thomas and Dill. The simulated annealing procedure rapidly and frequently generates native-like structures for small helical proteins and better than random structures for small beta sheet containing proteins. Most of the simulated structures have native-like solvent accessibility and secondary structure patterns, and thus ensembles of these structures provide a particularly challenging set of decoys for evaluating scoring functions. We investigate the effects of multiple sequence information and different types of conformational constraints on the overall performance of the method, and the ability of a variety of recently developed scoring functions to recognize the native-like conformations in the ensembles of simulated structures.
我们利用贝叶斯评分函数,探索一种简单的模拟退火程序从具有相似局部序列的不相关蛋白质结构片段组装类似天然结构的能力。环境和残基对评分函数的特定贡献在蛋白质数据库中残基概率分布的级数展开中表现为前两项;分布的距离和环境依赖性的解耦解决了Thomas和Dill指出的当前基于数据库的评分函数的主要问题。模拟退火程序能快速且频繁地为小螺旋蛋白生成类似天然的结构,对于含小β折叠的蛋白,生成的结构优于随机结构。大多数模拟结构具有类似天然的溶剂可及性和二级结构模式,因此这些结构的集合为评估评分函数提供了一组特别具有挑战性的诱饵。我们研究了多序列信息和不同类型的构象约束对该方法整体性能的影响,以及各种最近开发的评分函数识别模拟结构集合中类似天然构象的能力。