Shi Shuoyong, Pei Jimin, Sadreyev Ruslan I, Kinch Lisa N, Majumdar Indraneel, Tong Jing, Cheng Hua, Kim Bong-Hyun, Grishin Nick V
Howard Hughes Medical Institute and Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA.
Database (Oxford). 2009;2009:bap003. doi: 10.1093/database/bap003. Epub 2009 Apr 14.
Results of the recent Critical Assessment of Techniques for Protein Structure Prediction, CASP8, present several valuable sources of information. First, CASP targets comprise a realistic sample of currently solved protein structures and exemplify the corresponding challenges for predictors. Second, the plethora of predictions by all possible methods provides an unusually rich material for evolutionary analysis of target proteins. Third, CASP results show the current state of the field and highlight specific problems in both predicting and assessing. Finally, these data can serve as grounds to develop and analyze methods for assessing prediction quality. Here we present results of our analysis in these areas. Our objective is not to duplicate CASP assessment, but to use our unique experience as former CASP5 assessors and CASP8 predictors to (i) offer more insights into CASP targets and predictions based on expert analysis, including invaluable analysis prior to target structure release; and (ii) develop an assessment methodology tailored towards current challenges in the field. Specifically, we discuss preparing target structures for assessment, parsing protein domains, balancing evaluations based on domains and on whole chains, dividing targets into categories and developing new evaluation scores. We also present evolutionary analysis of the most interesting and challenging targets.Database URL: Our results are available as a comprehensive database of targets and predictions at http://prodata.swmed.edu/CASP8.
近期蛋白质结构预测技术关键评估(CASP8)的结果提供了几个有价值的信息来源。首先,CASP目标包含当前已解析蛋白质结构的实际样本,并例证了预测者面临的相应挑战。其次,所有可能方法的大量预测为目标蛋白质的进化分析提供了异常丰富的材料。第三,CASP结果展示了该领域的当前状态,并突出了预测和评估中的具体问题。最后,这些数据可作为开发和分析预测质量评估方法的依据。在此,我们展示我们在这些领域的分析结果。我们的目标不是重复CASP评估,而是利用我们作为前CASP5评估者和CASP8预测者的独特经验,(i)基于专家分析,包括在目标结构发布之前进行的宝贵分析,对CASP目标和预测提供更多见解;(ii)开发一种针对该领域当前挑战的评估方法。具体而言,我们讨论为评估准备目标结构、解析蛋白质结构域、平衡基于结构域和基于整条链的评估、将目标分类以及开发新的评估分数。我们还展示了对最有趣和最具挑战性目标的进化分析。数据库网址:我们的结果可在http://prodata.swmed.edu/CASP8上作为目标和预测的综合数据库获取。