Kolinski A, Skolnick J
Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA.
Proteins. 1998 Sep 1;32(4):475-94.
A new, efficient method for the assembly of protein tertiary structure from known, loosely encoded secondary structure restraints and sparse information about exact side chain contacts is proposed and evaluated. The method is based on a new, very simple method for the reduced modeling of protein structure and dynamics, where the protein is described as a lattice chain connecting side chain centers of mass rather than Calphas. The model has implicit built-in multibody correlations that simulate short- and long-range packing preferences, hydrogen bonding cooperativity and a mean force potential describing hydrophobic interactions. Due to the simplicity of the protein representation and definition of the model force field, the Monte Carlo algorithm is at least an order of magnitude faster than previously published Monte Carlo algorithms for structure assembly. In contrast to existing algorithms, the new method requires a smaller number of tertiary restraints for successful fold assembly; on average, one for every seven residues as compared to one for every four residues. For example, for smaller proteins such as the B domain of protein G, the resulting structures have a coordinate root mean square deviation (cRMSD), which is about 3 A from the experimental structure; for myoglobin, structures whose backbone cRMSD is 4.3 A are produced, and for a 247-residue TIM barrel, the cRMSD of the resulting folds is about 6 A. As would be expected, increasing the number of tertiary restraints improves the accuracy of the assembled structures. The reliability and robustness of the new method should enable its routine application in model building protocols based on various (very sparse) experimentally derived structural restraints.
本文提出并评估了一种新的高效方法,该方法可根据已知的、编码松散的二级结构限制以及关于精确侧链接触的稀疏信息来组装蛋白质三级结构。该方法基于一种全新且非常简单的蛋白质结构和动力学简化建模方法,在这种方法中,蛋白质被描述为连接侧链质心的晶格链,而非仅连接α碳原子。该模型具有隐含的内置多体相关性,可模拟短程和长程堆积偏好、氢键协同作用以及描述疏水相互作用的平均力势。由于蛋白质表示和模型力场定义的简单性,蒙特卡罗算法比之前发表的用于结构组装的蒙特卡罗算法至少快一个数量级。与现有算法不同,新方法成功折叠组装所需的三级结构限制数量更少;平均而言,每七个残基需要一个限制,而之前的方法是每四个残基需要一个。例如,对于较小的蛋白质,如蛋白G的B结构域,所得结构与实验结构的坐标均方根偏差(cRMSD)约为3 Å;对于肌红蛋白,产生的结构其主链cRMSD为4.3 Å;对于一个247个残基的TIM桶状结构,所得折叠结构的cRMSD约为6 Å。正如预期的那样,增加三级结构限制可提高组装结构的准确性。新方法的可靠性和稳健性应使其能够常规应用于基于各种(非常稀疏的)实验得出的结构限制的模型构建协议中。