Genome Center, University of California-Davis, 451 Health Sciences Drive, Davis, CA 95616, USA.
Proteins. 2011;79 Suppl 10(Suppl 10):91-106. doi: 10.1002/prot.23180. Epub 2011 Oct 14.
CASP has been assessing the state of the art in the a priori estimation of accuracy of protein structure prediction since 2006. The inclusion of model quality assessment category in CASP contributed to a rapid development of methods in this area. In the last experiment, 46 quality assessment groups tested their approaches to estimate the accuracy of protein models as a whole and/or on a per-residue basis. We assessed the performance of these methods predominantly on the basis of the correlation between the predicted and observed quality of the models on both global and local scales. The ability of the methods to identify the models closest to the best one, to differentiate between good and bad models, and to identify well modeled regions was also analyzed. Our evaluations demonstrate that even though global quality assessment methods seem to approach perfection point (weighted average per-target Pearson's correlation coefficients are as high as 0.97 for the best groups), there is still room for improvement. First, all top-performing methods use consensus approaches to generate quality estimates, and this strategy has its own limitations. Second, the methods that are based on the analysis of individual models lag far behind clustering techniques and need a boost in performance. The methods for estimating per-residue accuracy of models are less accurate than global quality assessment methods, with an average weighted per-model correlation coefficient in the range of 0.63-0.72 for the best 10 groups.
自 2006 年以来,CASP 一直在评估先验估计蛋白质结构预测准确性的最新技术。CASP 中模型质量评估类别的纳入促进了该领域方法的快速发展。在最后一次实验中,46 个质量评估组测试了他们的方法来整体评估和/或逐残基评估蛋白质模型的准确性。我们主要基于模型在全局和局部尺度上的预测质量与观察质量之间的相关性来评估这些方法的性能。还分析了这些方法识别最接近最佳模型、区分良好模型和不良模型以及识别建模良好区域的能力。我们的评估表明,尽管全局质量评估方法似乎接近完美点(最佳组的加权平均每个目标皮尔逊相关系数高达 0.97),但仍有改进的空间。首先,所有表现最好的方法都使用共识方法来生成质量估计,而这种策略有其自身的局限性。其次,基于分析单个模型的方法远远落后于聚类技术,需要提高性能。预测模型残基准确性的方法不如全局质量评估方法准确,最佳的 10 个组的平均加权每个模型相关系数在 0.63-0.72 范围内。