Howard Hughes Medical Institute, University of Texas, Southwestern Medical Center, Dallas, TX 75390-9050, USA.
Proteins. 2011;79 Suppl 10(Suppl 10):21-36. doi: 10.1002/prot.23190. Epub 2011 Oct 14.
The Critical assessment of protein structure prediction round 9 (CASP9) aimed to evaluate predictions for 129 experimentally determined protein structures. To assess tertiary structure predictions, these target structures were divided into domain-based evaluation units that were then classified into two assessment categories: template based modeling (TBM) and template free modeling (FM). CASP9 targets were split into domains of structurally compact evolutionary modules. For the targets with more than one defined domain, the decision to split structures into domains for evaluation was based on server performance. Target domains were categorized based on their evolutionary relatedness to existing templates as well as their difficulty levels indicated by server performance. Those target domains with sequence-related templates and high server prediction performance were classified as TMB, whereas those targets without identifiable templates and low server performance were classified as FM. However, using these generalizations for classification resulted in a blurred boundary between CASP9 assessment categories. Thus, the FM category included those domains without sequence detectable templates (25 target domains) as well as some domains with difficult to detect templates whose predictions were as poor as those without templates (five target domains). Several interesting examples are discussed, including targets with sequence related templates that exhibit unusual structural differences, targets with homologous or analogous structure templates that are not detectable by sequence, and targets with new folds.
第 9 届蛋白质结构预测评估竞赛(CASP9)旨在评估 129 个实验确定的蛋白质结构的预测结果。为了评估三级结构预测,这些目标结构被分为基于结构域的评估单元,然后分为两个评估类别:基于模板的建模(TBM)和无模板建模(FM)。CASP9 的目标被分割为结构紧凑的进化模块的结构域。对于具有一个以上定义结构域的目标,将结构域分割为评估结构域的决策是基于服务器性能的。目标结构域根据其与现有模板的进化关系以及服务器性能所指示的难度级别进行分类。那些具有序列相关模板和高服务器预测性能的目标结构域被归类为 TBM,而那些没有可识别模板且服务器性能低的目标结构域被归类为 FM。然而,这些分类的泛化导致了 CASP9 评估类别之间的界限模糊。因此,FM 类别包括那些没有序列可检测模板的结构域(25 个目标结构域),以及一些模板难以检测但其预测与没有模板一样差的结构域(5 个目标结构域)。讨论了几个有趣的例子,包括具有序列相关模板但表现出不寻常结构差异的目标,具有同源或类似结构模板但序列不可检测的目标,以及具有新折叠的目标。