Stolovitzky Gustavo, Prill Robert J, Califano Andrea
IBM Computational Biology Center, Yorktown Heights, New York, USA.
Ann N Y Acad Sci. 2009 Mar;1158:159-95. doi: 10.1111/j.1749-6632.2009.04497.x.
Regardless of how creative, innovative, and elegant our computational methods, the ultimate proof of an algorithm's worth is the experimentally validated quality of its predictions. Unfortunately, this truism is hard to reduce to practice. Usually, modelers produce hundreds to hundreds of thousands of predictions, most (if not all) of which go untested. In a best-case scenario, a small subsample of predictions (three to ten usually) is experimentally validated, as a quality control step to attest to the global soundness of the full set of predictions. However, whether this small set is even representative of the global algorithm's performance is a question usually left unaddressed. Thus, a clear understanding of the strengths and weaknesses of an algorithm most often remains elusive, especially to the experimental biologists who must decide which tool to use to address a specific problem. In this chapter, we describe the first systematic set of challenges posed to the systems biology community in the framework of the DREAM (Dialogue for Reverse Engineering Assessments and Methods) project. These tests, which came to be known as the DREAM2 challenges, consist of data generously donated by participants to the DREAM project and curated in such a way as to become problems of network reconstruction and whose solutions, the actual networks behind the data, were withheld from the participants. The explanation of the resulting five challenges, a global comparison of the submissions, and a discussion of the best performing strategies are the main topics discussed.
无论我们的计算方法多么富有创造性、创新性和优雅性,算法价值的最终证明是其预测结果经过实验验证的质量。不幸的是,这条真理很难付诸实践。通常,建模者会生成成百上千甚至成千上万的预测结果,其中大多数(如果不是全部的话)都未经测试。在最佳情况下,会对一小部分预测结果(通常为三到十个)进行实验验证,作为质量控制步骤,以证明整套预测结果在整体上的可靠性。然而,这一小部分结果是否能代表全局算法的性能,这个问题通常无人问津。因此,对于算法优缺点的清晰理解往往仍然难以实现,尤其是对于那些必须决定使用哪种工具来解决特定问题的实验生物学家来说。在本章中,我们描述了在DREAM(逆向工程评估与方法对话)项目框架下,系统生物学界面临的第一组系统性挑战。这些测试后来被称为DREAM2挑战,由参与者慷慨捐赠给DREAM项目的数据组成,并经过整理,使其成为网络重建问题,而数据背后的实际网络(即解决方案)对参与者保密。本章主要讨论由此产生的五个挑战、提交结果的全局比较以及对最佳执行策略的讨论。