Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.
SIB Swiss Institute of Bioinformatics, Zurich, Switzerland.
Genome Biol. 2023 May 17;24(1):119. doi: 10.1186/s13059-023-02962-5.
Computational methods represent the lifeblood of modern molecular biology. Benchmarking is important for all methods, but with a focus here on computational methods, benchmarking is critical to dissect important steps of analysis pipelines, formally assess performance across common situations as well as edge cases, and ultimately guide users on what tools to use. Benchmarking can also be important for community building and advancing methods in a principled way. We conducted a meta-analysis of recent single-cell benchmarks to summarize the scope, extensibility, and neutrality, as well as technical features and whether best practices in open data and reproducible research were followed. The results highlight that while benchmarks often make code available and are in principle reproducible, they remain difficult to extend, for example, as new methods and new ways to assess methods emerge. In addition, embracing containerization and workflow systems would enhance reusability of intermediate benchmarking results, thus also driving wider adoption.
计算方法是现代分子生物学的生命线。基准测试对于所有方法都很重要,但这里重点关注计算方法,基准测试对于剖析分析管道的重要步骤、正式评估常见情况和边缘情况的性能以及最终指导用户使用哪些工具都至关重要。基准测试对于社区建设和以有原则的方式推进方法也很重要。我们对最近的单细胞基准测试进行了荟萃分析,以总结范围、可扩展性和中立性,以及技术特征,以及是否遵循开放数据和可重复研究的最佳实践。结果表明,虽然基准测试通常提供代码并且原则上是可重复的,但它们仍然难以扩展,例如,随着新方法和评估方法的新方法的出现。此外,采用容器化和工作流系统将增强中间基准测试结果的可重用性,从而也推动更广泛的采用。