Canutescu Adrian A, Shelenkov Andrew A, Dunbrack Roland L
Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania 19111, USA.
Protein Sci. 2003 Sep;12(9):2001-14. doi: 10.1110/ps.03154503.
Fast and accurate side-chain conformation prediction is important for homology modeling, ab initio protein structure prediction, and protein design applications. Many methods have been presented, although only a few computer programs are publicly available. The SCWRL program is one such method and is widely used because of its speed, accuracy, and ease of use. A new algorithm for SCWRL is presented that uses results from graph theory to solve the combinatorial problem encountered in the side-chain prediction problem. In this method, side chains are represented as vertices in an undirected graph. Any two residues that have rotamers with nonzero interaction energies are considered to have an edge in the graph. The resulting graph can be partitioned into connected subgraphs with no edges between them. These subgraphs can in turn be broken into biconnected components, which are graphs that cannot be disconnected by removal of a single vertex. The combinatorial problem is reduced to finding the minimum energy of these small biconnected components and combining the results to identify the global minimum energy conformation. This algorithm is able to complete predictions on a set of 180 proteins with 34342 side chains in <7 min of computer time. The total chi(1) and chi(1 + 2) dihedral angle accuracies are 82.6% and 73.7% using a simple energy function based on the backbone-dependent rotamer library and a linear repulsive steric energy. The new algorithm will allow for use of SCWRL in more demanding applications such as sequence design and ab initio structure prediction, as well addition of a more complex energy function and conformational flexibility, leading to increased accuracy.
快速准确的侧链构象预测对于同源建模、从头算蛋白质结构预测和蛋白质设计应用至关重要。虽然已经提出了许多方法,但只有少数计算机程序可供公开使用。SCWRL程序就是这样一种方法,因其速度、准确性和易用性而被广泛使用。本文提出了一种新的SCWRL算法,该算法利用图论结果来解决侧链预测问题中遇到的组合问题。在这种方法中,侧链被表示为无向图中的顶点。任何两个具有非零相互作用能的旋转异构体的残基被认为在图中有一条边。得到的图可以被划分为相互之间没有边的连通子图。这些子图又可以被分解为双连通分量,双连通分量是指不能通过移除单个顶点而断开连接的图。组合问题被简化为找到这些小双连通分量的最小能量,并将结果组合起来以确定全局最小能量构象。该算法能够在不到7分钟的计算机时间内完成对一组包含34342个侧链的180种蛋白质的预测。使用基于主链依赖旋转异构体库和线性排斥空间能的简单能量函数,总的χ(1)和χ(1 + 2)二面角准确率分别为82.6%和73.7%。新算法将允许在更苛刻的应用中使用SCWRL,如序列设计和从头算结构预测,以及添加更复杂的能量函数和构象灵活性,从而提高准确性。