Mönnigmann M, Floudas C A
Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, USA.
Proteins. 2005 Dec 1;61(4):748-62. doi: 10.1002/prot.20669.
The structure prediction of loops with flexible stem residues is addressed in this article. While the secondary structure of the stem residues is assumed to be known, the geometry of the protein into which the loop must fit is considered to be unknown in our methodology. As a consequence, the compatibility of the loop with the remainder of the protein is not used as a criterion to reject loop decoys. The loop structure prediction with flexible stems is more difficult than fitting loops into a known protein structure in that a larger conformational space has to be covered. The main focus of the study is to assess the precision of loop structure prediction if no information on the protein geometry is available. The proposed approach is based on (1) dihedral angle sampling, (2) structure optimization by energy minimization with a physically based energy function, (3) clustering, and (4) a comparison of strategies for the selection of loops identified in (3). Steps (1) and (2) have similarities to previous approaches to loop structure prediction with fixed stems. Step (3) is based on a new iterative approach to clustering that is tailored for the loop structure prediction problem with flexible stems. In this new approach, clustering is not only used to identify conformers that are likely to be close to the native structure, but clustering is also employed to identify far-from-native decoys. By discarding these decoys iteratively, the overall quality of the ensemble and the loop structure prediction is improved. Step (4) provides a comparative study of criteria for loop selection based on energy, colony energy, cluster density, and a hybrid criterion introduced here. The proposed method is tested on a large set of 3215 loops from proteins in the Pdb-Select25 set and to 179 loops from proteins from the Casp6 experiment.
本文探讨了具有柔性茎残基的环的结构预测问题。虽然茎残基的二级结构被假定为已知,但在我们的方法中,环必须适配的蛋白质的几何形状被认为是未知的。因此,环与蛋白质其余部分的兼容性未被用作拒绝环诱饵的标准。具有柔性茎的环结构预测比将环拟合到已知蛋白质结构中更困难,因为必须覆盖更大的构象空间。该研究的主要重点是评估在没有蛋白质几何形状信息的情况下环结构预测的精度。所提出的方法基于:(1)二面角采样;(2)使用基于物理的能量函数通过能量最小化进行结构优化;(3)聚类;以及(4)对在(3)中确定的环的选择策略进行比较。步骤(1)和(2)与先前用于固定茎的环结构预测方法有相似之处。步骤(3)基于一种新的迭代聚类方法,该方法是为具有柔性茎的环结构预测问题量身定制的。在这种新方法中,聚类不仅用于识别可能接近天然结构的构象异构体,还用于识别远离天然的诱饵。通过迭代丢弃这些诱饵,提高了整体集合的质量和环结构预测的质量。步骤(4)提供了基于能量、群体能量、簇密度以及此处引入的混合标准的环选择标准的比较研究。所提出的方法在来自Pdb-Select25集中蛋白质的3215个环的大型数据集以及来自Casp6实验中蛋白质的179个环上进行了测试。