Cold Spring Harbor Laboratories, Cold Spring Harbor, Ithaca, NY, USA.
USDA ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, New York, USA.
Bioinformatics. 2018 Nov 15;34(22):3917-3920. doi: 10.1093/bioinformatics/bty439.
The rapid accumulation of both sequence and phenotype data generated by high-throughput methods has increased the need to store and analyze data on distributed storage and computing systems. Efficient data management across these heterogeneous systems requires a workflow management system to simplify the task of analysis through automation and make large-scale bioinformatics analyses accessible and reproducible.
We developed SciApps, a web-based platform for reproducible bioinformatics workflows. The platform is designed to automate the execution of modular Agave apps and support execution of workflows on local clusters or in a cloud. Two workflows, one for association and one for annotation, are provided as exemplar scientific use cases.
Supplementary data are available at Bioinformatics online.
高通量方法产生的序列和表型数据的快速积累增加了对分布式存储和计算系统上的数据存储和分析的需求。要在这些异构系统中实现高效的数据管理,需要一个工作流管理系统,通过自动化来简化分析任务,并使大规模生物信息学分析具有可访问性和可重复性。
我们开发了 SciApps,这是一个基于网络的可重复生物信息学工作流程平台。该平台旨在自动执行模块化的 Agave 应用程序,并支持在本地集群或云中执行工作流程。提供了两个工作流程,一个用于关联,一个用于注释,作为示范科学用例。
补充数据可在《Bioinformatics》在线获得。