Colubri Andrés
Searle Chemistry Lab, University of Chicago, 5735 South Ellis Ave #126, Chicago, Illinois 60637, USA.
J Biomol Struct Dyn. 2004 Apr;21(5):625-38. doi: 10.1080/07391102.2004.10506953.
A set of software tools designed to study protein structure and kinetics has been developed. The core of these tools is a program called Folding Machine (FM) which is able to generate low resolution folding pathways using modest computational resources. The FM is based on a coarse-grained kinetic ab initio Monte-Carlo sampler that can optionally use information extracted from secondary structure prediction servers or from fragment libraries of local structure. The model underpinning this algorithm contains two novel elements: (a) the conformational space is discretized using the Ramachandran basins defined in the local phi-psi energy maps; and (b) the solvent is treated implicitly by rescaling the pairwise terms of the non-bonded energy function according to the local solvent environments. The purpose of this hybrid ab initio/knowledge-based approach is threefold: to cover the long time scales of folding, to generate useful 3-dimensional models of protein structures, and to gain insight on the protein folding kinetics. Even though the algorithm is not yet fully developed, it has been used in a recent blind test of protein structure prediction (CASP5). The FM generated models within 6 A backbone rmsd for fragments of about 60-70 residues of alpha-helical proteins. For a CASP5 target that turned out to be natively unfolded, the trajectory obtained for this sequence uniquely failed to converge. Also, a new measure to evaluate structure predictions is presented and used along the standard CASP assessment methods. Finally, recent improvements in the prediction of beta-sheet structures are briefly described.
已经开发出一套用于研究蛋白质结构和动力学的软件工具。这些工具的核心是一个名为折叠机(FM)的程序,它能够使用适度的计算资源生成低分辨率的折叠路径。FM基于一种粗粒度的从头算动力学蒙特卡罗采样器,该采样器可以选择使用从二级结构预测服务器或局部结构片段库中提取的信息。支撑该算法的模型包含两个新元素:(a)使用局部phi-psi能量图中定义的拉马钱德兰盆地对构象空间进行离散化;(b)根据局部溶剂环境重新缩放非键合能量函数的成对项来隐式处理溶剂。这种混合从头算/基于知识的方法有三个目的:涵盖折叠的长时间尺度,生成有用的蛋白质结构三维模型,并深入了解蛋白质折叠动力学。尽管该算法尚未完全开发,但它已被用于最近的蛋白质结构预测盲测(CASP5)。对于α-螺旋蛋白约60-70个残基的片段,FM生成的模型在主链均方根偏差6埃以内。对于一个在天然状态下未折叠的CASP5目标,该序列获得的轨迹唯一未能收敛。此外,还提出了一种评估结构预测的新方法,并与标准的CASP评估方法一起使用。最后,简要描述了β-折叠结构预测方面的最新改进。