• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BioFlow-Insight:通过结构重建和可视化促进Nextflow工作流程的重用。

BioFlow-Insight: facilitating reuse of Nextflow workflows with structure reconstruction and visualization.

作者信息

Marchment George, Brancotte Bryan, Schmit Marie, Lemoine Frédéric, Cohen-Boulakia Sarah

机构信息

Université Paris-Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, 91405, Orsay, France.

Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, Paris, France.

出版信息

NAR Genom Bioinform. 2024 Aug 6;6(3):lqae092. doi: 10.1093/nargab/lqae092. eCollection 2024 Sep.

DOI:10.1093/nargab/lqae092
PMID:39108637
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11302447/
Abstract

Bioinformatics workflows are increasingly used for sharing analyses, serving as a cornerstone for enhancing the reproducibility and shareability of bioinformatics analyses. In particular, Nextflow is a commonly used workflow system, permitting the creation of large workflows while offering substantial flexibility. An increasing number of Nextflow workflows are being shared on repositories such as GitHub. However, this tremendous opportunity to reuse existing code remains largely underutilized. In cause, the increasing complexity of workflows constitute a major obstacle to code reuse. Consequently, there is a rising need for tools that can help bioinformaticians extract valuable information from their own and others' workflows. To facilitate workflow inspection and reuse, we developed BioFlow-Insight to automatically analyze the code of Nextflow workflows and generate useful information, particularly in the form of visual graphs depicting the workflow's structure and representing its individual analysis steps. BioFlow-Insight is an open-source tool, available as both a command-line interface and a web service. It is accessible at https://pypi.org/project/bioflow-insight/ and https://bioflow-insight.pasteur.cloud/.

摘要

生物信息学工作流程越来越多地用于共享分析,是提高生物信息学分析的可重复性和可共享性的基石。特别是,Nextflow是一种常用的工作流程系统,它允许创建大型工作流程,同时提供极大的灵活性。越来越多的Nextflow工作流程在GitHub等代码库上共享。然而,这种重用现有代码的巨大机会在很大程度上仍未得到充分利用。原因在于,工作流程日益复杂构成了代码重用的主要障碍。因此,对能够帮助生物信息学家从他们自己和他人的工作流程中提取有价值信息的工具的需求日益增加。为了促进工作流程检查和重用,我们开发了BioFlow-Insight来自动分析Nextflow工作流程的代码并生成有用信息,特别是以描绘工作流程结构并表示其各个分析步骤的可视化图形的形式。BioFlow-Insight是一个开源工具,有命令行界面和网络服务两种形式。可通过https://pypi.org/project/bioflow-insight/和https://bioflow-insight.pasteur.cloud/访问它。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b827/11302447/d83452017e11/lqae092fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b827/11302447/fc793a21c2db/lqae092fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b827/11302447/d83452017e11/lqae092fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b827/11302447/fc793a21c2db/lqae092fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b827/11302447/d83452017e11/lqae092fig2.jpg

相似文献

1
BioFlow-Insight: facilitating reuse of Nextflow workflows with structure reconstruction and visualization.BioFlow-Insight:通过结构重建和可视化促进Nextflow工作流程的重用。
NAR Genom Bioinform. 2024 Aug 6;6(3):lqae092. doi: 10.1093/nargab/lqae092. eCollection 2024 Sep.
2
Using prototyping to choose a bioinformatics workflow management system.使用原型法选择生物信息学工作流管理系统。
PLoS Comput Biol. 2021 Feb 25;17(2):e1008622. doi: 10.1371/journal.pcbi.1008622. eCollection 2021 Feb.
3
Distilling structure in Taverna scientific workflows: a refactoring approach.Taverna 科学工作流中的结构提取:一种重构方法。
BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2105-15-S1-S12. Epub 2014 Jan 10.
4
The Dockstore: enhancing a community platform for sharing reproducible and accessible computational protocols.Dockstore:增强了一个用于共享可重复和可访问的计算协议的社区平台。
Nucleic Acids Res. 2021 Jul 2;49(W1):W624-W632. doi: 10.1093/nar/gkab346.
5
NFTest: automated testing of Nextflow pipelines.NFTest:用于 Nextflow 管道的自动化测试。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae081.
6
Semantic workflows for benchmark challenges: Enhancing comparability, reusability and reproducibility.用于基准挑战的语义工作流:提高可比性、可重用性和可重复性。
Pac Symp Biocomput. 2019;24:208-219.
7
Sapporo: A workflow execution service that encourages the reuse of workflows in various languages in bioinformatics.札幌:一个工作流执行服务,鼓励在生物信息学中重用各种语言的工作流。
F1000Res. 2024 Jun 24;11:889. doi: 10.12688/f1000research.122924.2. eCollection 2022.
8
Utility of the Python package Geoweaver_cwl for improving workflow reusability: an illustration with multidisciplinary use cases.用于提高工作流程可重用性的Python包Geoweaver_cwl的效用:多学科用例说明
Earth Sci Inform. 2023;16(3):2955-2961. doi: 10.1007/s12145-023-01045-0. Epub 2023 Jul 10.
9
Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support.Tavaxy:集成 Taverna 和 Galaxy 工作流并提供云计算支持。
BMC Bioinformatics. 2012 May 4;13:77. doi: 10.1186/1471-2105-13-77.
10
Geniac: Automatic Configuration GENerator and Installer for nextflow pipelines.Geniac:用于Nextflow管道的自动配置生成器与安装程序。
Open Res Eur. 2022 Feb 21;1:76. doi: 10.12688/openreseurope.13861.2. eCollection 2021.

本文引用的文献

1
Ten quick tips for building FAIR workflows.构建FAIR工作流程的十条快速提示。
PLoS Comput Biol. 2023 Sep 28;19(9):e1011369. doi: 10.1371/journal.pcbi.1011369. eCollection 2023 Sep.
2
Developing and reusing bioinformatics data analysis pipelines using scientific workflow systems.使用科学工作流系统开发和重用生物信息学数据分析管道。
Comput Struct Biotechnol J. 2023 Mar 7;21:2075-2085. doi: 10.1016/j.csbj.2023.03.003. eCollection 2023.
3
Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers.
使用生物信息学工作流管理器的可重复、可扩展且可共享的分析管道。
Nat Methods. 2021 Oct;18(10):1161-1168. doi: 10.1038/s41592-021-01254-9. Epub 2021 Sep 23.
4
The nf-core framework for community-curated bioinformatics pipelines.用于社区策划生物信息学流程的nf-core框架。
Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.
5
Singularity: Scientific containers for mobility of compute.奇点:用于计算移动性的科学容器。
PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017.
6
Nextflow enables reproducible computational workflows.Nextflow支持可重复的计算工作流程。
Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820.
7
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update.用于可访问、可重复和协作式生物医学分析的Galaxy平台:2016年更新
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10. doi: 10.1093/nar/gkw343. Epub 2016 May 2.
8
Snakemake--a scalable bioinformatics workflow engine.Snakemake——一个可扩展的生物信息学工作流引擎。
Bioinformatics. 2012 Oct 1;28(19):2520-2. doi: 10.1093/bioinformatics/bts480. Epub 2012 Aug 20.
9
myExperiment: a repository and social network for the sharing of bioinformatics workflows.myExperiment:一个用于生物信息学工作流程共享的存储库和社交网络。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W677-82. doi: 10.1093/nar/gkq429. Epub 2010 May 25.
10
Taverna: a tool for the composition and enactment of bioinformatics workflows.Taverna:一种用于生物信息学工作流程的组合与执行的工具。
Bioinformatics. 2004 Nov 22;20(17):3045-54. doi: 10.1093/bioinformatics/bth361. Epub 2004 Jun 16.