Suppr超能文献

使用基于浏览器的交互式笔记本和容器实现可重复的 Bioconductor 工作流程。

Reproducible Bioconductor workflows using browser-based interactive notebooks and containers.

机构信息

Institute of Technology, University of Washington, Tacoma, WA, USA.

Department of Clinical Investigation, Madigan Army Medical Center, Tacoma, WA, USA.

出版信息

J Am Med Inform Assoc. 2018 Jan 1;25(1):4-12. doi: 10.1093/jamia/ocx120.

Abstract

OBJECTIVE

Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server.

MATERIALS AND METHODS

We present four different interactive Jupyter notebooks using R and Bioconductor workflows to infer differential gene expression, analyze cross-platform datasets, process RNA-seq data and KinomeScan data. These interactive notebooks are available on GitHub. The analytical results can be viewed in a browser. Most importantly, the software contents can be executed and modified. This is accomplished using Binder, which runs the notebook inside software containers, thus avoiding the need to install any software and ensuring reproducibility. All the notebooks were produced using custom files generated by BiocImageBuilder.

RESULTS

BiocImageBuilder facilitates the publication of workflows with a point-and-click user interface. We demonstrate that interactive notebooks can be used to disseminate a wide range of bioinformatics analyses. The use of software containers to mirror the original software environment ensures reproducibility of results. Parameters and code can be dynamically modified, allowing for robust verification of published results and encouraging rapid adoption of new methods.

CONCLUSION

Given the increasing complexity of bioinformatics workflows, we anticipate that these interactive software notebooks will become as necessary for documenting software methods as traditional laboratory notebooks have been for documenting bench protocols, and as ubiquitous.

摘要

目的

生物信息学出版物通常包含复杂的软件工作流程,难以在文稿中描述。我们描述并展示了使用交互式软件笔记本记录和分发生物信息学研究的方法。我们提供了一个用户友好的工具 BiocImageBuilder,允许用户通过上传到 GitHub 存储库或专用服务器的交互式笔记本轻松分发他们的生物信息学协议。

材料与方法

我们使用 R 和 Bioconductor 工作流程展示了四个不同的交互式 Jupyter 笔记本,用于推断差异基因表达、分析跨平台数据集、处理 RNA-seq 数据和 KinomeScan 数据。这些交互式笔记本可在 GitHub 上查看。可以在浏览器中查看分析结果。最重要的是,可以执行和修改软件内容。这是通过 Binder 实现的,它在软件容器中运行笔记本,从而避免了安装任何软件的需要,并确保了可重复性。所有笔记本都是使用 BiocImageBuilder 生成的自定义文件制作的。

结果

BiocImageBuilder 通过点击式用户界面简化了工作流程的发布。我们证明,交互式笔记本可用于传播广泛的生物信息学分析。使用软件容器来镜像原始软件环境可确保结果的可重复性。参数和代码可以动态修改,从而可以对已发布结果进行稳健验证,并鼓励快速采用新方法。

结论

鉴于生物信息学工作流程的复杂性不断增加,我们预计这些交互式软件笔记本将像传统实验室笔记本记录工作台协议一样,成为记录软件方法的必要工具,并且会变得无处不在。

相似文献

6
Appyters: Turning Jupyter Notebooks into data-driven web apps.Appyters:将Jupyter笔记本转变为数据驱动的网络应用程序。
Patterns (N Y). 2021 Mar 4;2(3):100213. doi: 10.1016/j.patter.2021.100213. eCollection 2021 Mar 12.

引用本文的文献

6
Building Containerized Workflows Using the BioDepot-Workflow-Builder.使用 BioDepot-Workflow-Builder 构建容器化工作流程。
Cell Syst. 2019 Nov 27;9(5):508-514.e3. doi: 10.1016/j.cels.2019.08.007. Epub 2019 Sep 11.
10
Does health informatics have a replication crisis?健康信息学是否存在复制危机?
J Am Med Inform Assoc. 2018 Aug 1;25(8):963-968. doi: 10.1093/jamia/ocy028.

本文引用的文献

1
Building Containerized Workflows Using the BioDepot-Workflow-Builder.使用 BioDepot-Workflow-Builder 构建容器化工作流程。
Cell Syst. 2019 Nov 27;9(5):508-514.e3. doi: 10.1016/j.cels.2019.08.007. Epub 2019 Sep 11.
10
The Economics of Reproducibility in Preclinical Research.临床前研究中的可重复性经济学
PLoS Biol. 2015 Jun 9;13(6):e1002165. doi: 10.1371/journal.pbio.1002165. eCollection 2015 Jun.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验