Bower M J, Cohen F E, Dunbrack R L
Department of Pharmaceutical Chemistry, University of California San Francisco, 94143-0450, USA.
J Mol Biol. 1997 Apr 18;267(5):1268-82. doi: 10.1006/jmbi.1997.0926.
Modeling by homology is the most accurate computational method for translating an amino acid sequence into a protein structure. Homology modeling can be divided into two sub-problems, placing the polypeptide backbone and adding side-chains. We present a method for rapidly predicting the conformations of protein side-chains, starting from main-chain coordinates alone. The method involves using fewer than ten rotamers per residue from a backbone-dependent rotamer library and a search to remove steric conflicts. The method is initially tested on 299 high resolution crystal structures by rebuilding side-chains onto the experimentally determined backbone structures. A total of 77% of chi1 and 66% of chi(1 + 2) dihedral angles are predicted within 40 degrees of their crystal structure values. We then tested the method on the entire database of known structures in the Protein Data Bank. The predictive accuracy of the algorithm was strongly correlated with the resolution of the structures. In an effort to simulate a realistic homology modeling problem, 9424 homology models were created using three different modeling strategies. For prediction purposes, pairs of structures were identified which shared between 30% and 90% sequence identity. One strategy results in 82% of chi1 and 72% chi(1 + 2) dihedral angles predicted within 40 degrees of the target crystal structure values, suggesting that movements of the backbone associated with this degree of sequence identity are not large enough to disrupt the predictive ability of our method for non-native backbones. These results compared favorably with existing methods over a comprehensive data set.
通过同源性建模是将氨基酸序列转化为蛋白质结构最准确的计算方法。同源性建模可分为两个子问题,即放置多肽主链和添加侧链。我们提出了一种仅从主链坐标开始快速预测蛋白质侧链构象的方法。该方法涉及使用来自依赖主链的旋转异构体库中每个残基少于十个的旋转异构体,并进行搜索以消除空间冲突。该方法首先通过将侧链重建到实验确定的主链结构上,在299个高分辨率晶体结构上进行测试。总共77%的χ1和66%的χ(1 + 2)二面角预测值与它们的晶体结构值相差在40度以内。然后我们在蛋白质数据库中已知结构的整个数据库上测试该方法。该算法的预测准确性与结构的分辨率密切相关。为了模拟一个现实的同源性建模问题,使用三种不同的建模策略创建了9424个同源性模型。为了进行预测,确定了序列同一性在30%至90%之间的成对结构。一种策略导致82%的χ1和72%的χ(1 + 2)二面角预测值与目标晶体结构值相差在40度以内,这表明与这种程度的序列同一性相关的主链移动不足以破坏我们的方法对非天然主链的预测能力。在一个全面的数据集中,这些结果与现有方法相比具有优势。