Kahanda Indika, Funk Christopher S, Ullah Fahad, Verspoor Karin M, Ben-Hur Asa
Department of Computer Science, Colorado State University, Fort Collins, 80523 CO USA.
Computational Bioscience Program, University of Colorado School of Medicine, Aurora, 80045 CO USA.
Gigascience. 2015 Sep 14;4:41. doi: 10.1186/s13742-015-0082-5. eCollection 2015.
The recently held Critical Assessment of Function Annotation challenge (CAFA2) required its participants to submit predictions for a large number of target proteins regardless of whether they have previous annotations or not. This is in contrast to the original CAFA challenge in which participants were asked to submit predictions for proteins with no existing annotations. The CAFA2 task is more realistic, in that it more closely mimics the accumulation of annotations over time. In this study we compare these tasks in terms of their difficulty, and determine whether cross-validation provides a good estimate of performance.
The CAFA2 task is a combination of two subtasks: making predictions on annotated proteins and making predictions on previously unannotated proteins. In this study we analyze the performance of several function prediction methods in these two scenarios. Our results show that several methods (structured support vector machine, binary support vector machines and guilt-by-association methods) do not usually achieve the same level of accuracy on these two tasks as that achieved by cross-validation, and that predicting novel annotations for previously annotated proteins is a harder problem than predicting annotations for uncharacterized proteins. We also find that different methods have different performance characteristics in these tasks, and that cross-validation is not adequate at estimating performance and ranking methods.
These results have implications for the design of computational experiments in the area of automated function prediction and can provide useful insight for the understanding and design of future CAFA competitions.
最近举办的功能注释关键评估挑战赛(CAFA2)要求参与者对大量目标蛋白质提交预测结果,无论这些蛋白质之前是否已有注释。这与最初的CAFA挑战赛不同,在最初的挑战赛中,要求参与者对没有现有注释的蛋白质提交预测结果。CAFA2任务更贴近现实,因为它更紧密地模拟了注释随时间的积累。在本研究中,我们从难度方面比较了这些任务,并确定交叉验证是否能很好地估计性能。
CAFA2任务是两个子任务的组合:对有注释的蛋白质进行预测以及对之前未注释的蛋白质进行预测。在本研究中,我们分析了几种功能预测方法在这两种情况下的性能。我们的结果表明,几种方法(结构化支持向量机、二元支持向量机和关联有罪方法)在这两个任务上通常无法达到与交叉验证相同的准确率水平,并且对之前已有注释的蛋白质预测新注释比预测未表征蛋白质的注释更难。我们还发现不同方法在这些任务中有不同的性能特征,并且交叉验证在估计性能和对方法进行排名方面并不充分。
这些结果对自动功能预测领域的计算实验设计有影响,并且能为理解和设计未来的CAFA竞赛提供有用的见解。