Khatib Firas, Weirauch Matthew T, Rohl Carol A
Department of Biomolecular Engineering, University of California at Santa Cruz, Santa Cruz, CA 95064, USA.
Bioinformatics. 2006 Jul 15;22(14):e252-9. doi: 10.1093/bioinformatics/btl236.
Knots in polypeptide chains have been found in very few proteins, and consequently should be generally avoided in protein structure prediction methods. Most effective structure prediction methods do not model the protein folding process itself, but rather seek only to correctly obtain the final native state. Consequently, the mechanisms that prevent knots from occurring in native proteins are not relevant to the modeling process, and as a result, knots can occur with significantly higher frequency in protein models. Here we describe Knotfind, a simple algorithm for knot detection that is fast enough for structure prediction, where tens or hundreds of thousands of conformations may be sampled during the course of a prediction. We have used this algorithm to characterize knots in large populations of model structures generated for targets in CASP 5 and CASP 6 using the Rosetta homology-based modeling method.
Analysis of CASP5 models suggested several possible avenues for introduction of knots into these models, and these insights were applied to structure prediction in CASP 6, resulting in a significant decrease in the proportion of knotted models generated. Additionally, using the knot detection algorithm on structures in the Protein Data Bank, a previously unreported deep trefoil knot was found in acetylornithine transcarbamylase.
The Knotfind algorithm is available in the Rosetta structure prediction program at http://www.rosettacommons.org.
在极少的蛋白质中发现了多肽链中的纽结,因此在蛋白质结构预测方法中通常应避免出现纽结。大多数有效的结构预测方法并不对蛋白质折叠过程本身进行建模,而是仅试图正确获得最终的天然状态。因此,防止纽结在天然蛋白质中出现的机制与建模过程无关,结果,纽结在蛋白质模型中出现的频率可能会显著更高。在此,我们描述了Knotfind,一种用于纽结检测的简单算法,其速度足够快,可用于结构预测,在预测过程中可能会对成千上万种构象进行采样。我们已使用该算法对使用基于同源性的Rosetta建模方法为CASP 5和CASP 6中的目标生成的大量模型结构中的纽结进行了表征。
对CASP5模型的分析提出了几种可能将纽结引入这些模型的途径,并且这些见解被应用于CASP 6中的结构预测,从而使生成的纽结模型的比例显著降低。此外,在蛋白质数据库中的结构上使用纽结检测算法,在乙酰鸟氨酸转氨甲酰酶中发现了一种先前未报道的深三叶纽结。
Knotfind算法可在Rosetta结构预测程序中获取,网址为http://www.rosettacommons.org。