Xiang Zhexin, Steinbach Peter J, Jacobson Matthew P, Friesner Richard A, Honig Barry
Center for Molecular Modeling, Center for Information Technology, National Institutes of Health, Bethesda, Maryland 20892-5624, USA.
Proteins. 2007 Mar 1;66(4):814-23. doi: 10.1002/prot.21099.
An approach is described that improves the prediction of the conformations of surface side chains in crystal structures, given the main-chain conformation of a protein. A key element of the methodology involves the use of the colony energy. This phenomenological term favors conformations found in frequently sampled regions, thereby approximating entropic effects and serving to smooth the potential energy surface. Use of the colony energy significantly improves prediction accuracy for surface side chains with little additional computational cost. Prediction accuracy was quantified as the percentage of side-chain dihedral angles predicted to be within 40 degrees of the angles measured by X-ray diffraction. Use of the colony energy in predictions for single side chains improved the prediction accuracy for chi(1) and chi(1+2) from 65 and 40% to 74 and 59%, respectively. Several other factors that affect prediction of surface side-chain conformations were also analyzed, including the extent of conformational sampling, details of the rotamer library employed, and accounting for the crystallographic environment. The prediction of conformations for polar residues on the surface was generally found to be more difficult than those for hydrophobic residues, except for polar residues participating in hydrogen bonds with other protein groups. For surface residues with hydrogen-bonded side chains, the prediction accuracy of chi(1) and chi(1+2) was 79 and 63%, respectively. For surface polar residues, in general (all side-chain prediction), the accuracy of chi(1) and chi(1+2) was only 73 and 56%, respectively. The most accurate results were obtained using the colony energy and an all-atom description that includes neighboring molecules in the crystal (protein chains and hetero atoms). Here, the accuracy of chi(1) and chi(1+2) predictions for surface side chains was 82 and 73%, respectively. The root mean square deviations obtained for hydrogen-bonding surface side chains were 1.64 and 1.81 A, with and without consideration of crystal packing effects, respectively.
本文描述了一种方法,在已知蛋白质主链构象的情况下,该方法可改进对晶体结构中表面侧链构象的预测。该方法的一个关键要素是使用群体能量。这个唯象学术语有利于在频繁采样区域中发现的构象,从而近似熵效应并有助于平滑势能面。使用群体能量以很少的额外计算成本显著提高了表面侧链的预测准确性。预测准确性被量化为预测的侧链二面角在通过X射线衍射测量的角度的40度范围内的百分比。在单一侧链预测中使用群体能量将χ(1)和χ(1+2)的预测准确性分别从65%和40%提高到74%和59%。还分析了其他几个影响表面侧链构象预测的因素,包括构象采样的程度、所采用的旋转异构体库的细节以及考虑晶体学环境。一般发现,预测表面极性残基的构象比预测疏水残基的构象更困难,但与其他蛋白质基团形成氢键的极性残基除外。对于具有氢键侧链的表面残基,χ(1)和χ(1+2)的预测准确性分别为79%和63%。对于表面极性残基,总体而言(所有侧链预测),χ(1)和χ(1+2)的准确性分别仅为73%和56%。使用群体能量和包括晶体中相邻分子(蛋白质链和杂原子)的全原子描述可获得最准确的结果。在此,表面侧链的χ(1)和χ(1+2)预测准确性分别为82%和73%。对于氢键表面侧链,考虑和不考虑晶体堆积效应时获得的均方根偏差分别为1.64 Å和1.81 Å。