Suppr超能文献

RABIX:一个支持工作流描述的可重新计算性和互操作性的开源工作流执行器。

RABIX: AN OPEN-SOURCE WORKFLOW EXECUTOR SUPPORTING RECOMPUTABILITY AND INTEROPERABILITY OF WORKFLOW DESCRIPTIONS.

作者信息

Kaushik Gaurav, Ivkovic Sinisa, Simonovic Janko, Tijanic Nebojsa, Davis-Dusenbery Brandi, Kural Deniz

机构信息

Seven Bridges Genomics, 1 Main Street, Cambridge, MA 02140, USA*Corresponding author.,

出版信息

Pac Symp Biocomput. 2017;22:154-165. doi: 10.1142/9789813207813_0016.

Abstract

As biomedical data has become increasingly easy to generate in large quantities, the methods used to analyze it have proliferated rapidly. Reproducible and reusable methods are required to learn from large volumes of data reliably. To address this issue, numerous groups have developed workflow specifications or execution engines, which provide a framework with which to perform a sequence of analyses. One such specification is the Common Workflow Language, an emerging standard which provides a robust and flexible framework for describing data analysis tools and workflows. In addition, reproducibility can be furthered by executors or workflow engines which interpret the specification and enable additional features, such as error logging, file organization, optim1izations to computation and job scheduling, and allow for easy computing on large volumes of data. To this end, we have developed the Rabix Executor, an open-source workflow engine for the purposes of improving reproducibility through reusability and interoperability of workflow descriptions.

摘要

随着生物医学数据越来越容易大量生成,用于分析这些数据的方法也迅速激增。需要可重复和可重用的方法来可靠地从大量数据中学习。为了解决这个问题,许多团队已经开发了工作流规范或执行引擎,它们提供了一个执行一系列分析的框架。其中一种规范就是通用工作流语言(Common Workflow Language),这是一种新兴标准,为描述数据分析工具和工作流提供了一个强大且灵活的框架。此外,执行器或工作流引擎可以进一步提高可重复性,它们解释规范并启用其他功能,如错误记录、文件组织、计算优化和作业调度,还允许对大量数据进行轻松计算。为此,我们开发了Rabix执行器,这是一个开源工作流引擎,旨在通过工作流描述的可重用性和互操作性来提高可重复性。

相似文献

9
Conveyor: a workflow engine for bioinformatic analyses.输送器:生物信息学分析的工作流引擎。
Bioinformatics. 2011 Apr 1;27(7):903-11. doi: 10.1093/bioinformatics/btr040. Epub 2011 Jan 28.

引用本文的文献

9
The role of metadata in reproducible computational research.元数据在可重复计算研究中的作用。
Patterns (N Y). 2021 Sep 10;2(9):100322. doi: 10.1016/j.patter.2021.100322.

本文引用的文献

1
A review of bioinformatic pipeline frameworks.生物信息学流程框架综述。
Brief Bioinform. 2017 May 1;18(3):530-536. doi: 10.1093/bib/bbw020.
4
Ten simple rules for reproducible computational research.可重复计算研究的十条简单规则。
PLoS Comput Biol. 2013 Oct;9(10):e1003285. doi: 10.1371/journal.pcbi.1003285. Epub 2013 Oct 24.
6
Snakemake--a scalable bioinformatics workflow engine.Snakemake——一个可扩展的生物信息学工作流引擎。
Bioinformatics. 2012 Oct 1;28(19):2520-2. doi: 10.1093/bioinformatics/bts480. Epub 2012 Aug 20.
7
Reproducible research in computational science.计算科学中的可重复性研究。
Science. 2011 Dec 2;334(6060):1226-7. doi: 10.1126/science.1213847.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验