通过NMDC EDGE资源实现标准化且可访问的多组学生物信息学工作流程。

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.

作者信息

Kelliher Julia M, Xu Yan, Flynn Mark C, Babinski Michal, Canon Shane, Cavanna Eric, Clum Alicia, Corilo Yuri E, Fujimoto Grant, Giberson Cameron, Johnson Leah Y D, Li Kaitlyn J, Li Po-E, Li Valerie, Lo Chien-Chi, Lynch Wendi, Piehowski Paul, Prime Kaelan, Purvine Samuel, Rodriguez Francisca, Roux Simon, Shakya Migun, Smith Montana, Sarrafan Setareh, Cholia Shreyas, McCue Lee Ann, Mungall Chris, Hu Bin, Eloe-Fadrosh Emiley A, Chain Patrick S G

机构信息

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA.

Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

出版信息

Comput Struct Biotechnol J. 2024 Sep 27;23:3575-3583. doi: 10.1016/j.csbj.2024.09.018. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.09.018

PMID:39963423

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11832004/

Abstract

Accessible and easy-to-use standardized bioinformatics workflows are necessary to advance microbiome research from observational studies to large-scale, data-driven approaches. Standardized multi-omics data enables comparative studies, data reuse, and applications of machine learning to model biological processes. To advance broad accessibility of standardized multi-omics bioinformatics workflows, the National Microbiome Data Collaborative (NMDC) has developed the Empowering the Development of Genomics Expertise (NMDC EDGE) resource, a user-friendly, open-source web application (https://nmdc-edge.org). Here, we describe the design and main functionality of the NMDC EDGE resource for processing metagenome, metatranscriptome, natural organic matter, and metaproteome data. The architecture relies on three main layers (web application, orchestration, and execution) to ensure flexibility and expansion to future workflows. The orchestration and execution layers leverage best practices in software containers and accommodate high-performance computing and cloud computing services. Further, we have adopted a robust user research process to collect feedback for continuous improvement of the resource. NMDC EDGE provides an accessible interface for researchers to process multi-omics microbiome data using production-quality workflows to facilitate improved data standardization and interoperability.

摘要

为了将微生物组研究从观察性研究推进到大规模、数据驱动的方法，需要有可访问且易于使用的标准化生物信息学工作流程。标准化的多组学数据能够实现比较研究、数据重用以及应用机器学习对生物过程进行建模。为了提高标准化多组学生物信息学工作流程的广泛可及性，国家微生物组数据协作组织（NMDC）开发了“增强基因组学专业知识发展”（NMDC EDGE）资源，这是一个用户友好的开源网络应用程序（https://nmdc-edge.org）。在此，我们描述了NMDC EDGE资源用于处理宏基因组、宏转录组、天然有机物和宏蛋白质组数据的设计和主要功能。该架构依赖于三个主要层次（网络应用程序、编排和执行），以确保灵活性并能扩展到未来的工作流程。编排层和执行层利用了软件容器中的最佳实践，并适应高性能计算和云计算服务。此外，我们采用了稳健的用户研究流程来收集反馈，以便对资源进行持续改进。NMDC EDGE为研究人员提供了一个可访问的界面，使他们能够使用高质量的生产工作流程来处理多组学微生物组数据，以促进提高数据标准化和互操作性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6683/11832004/b4ebf9573543/gr1.jpg

相似文献

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.

Comput Struct Biotechnol J. 2024 Sep 27;23:3575-3583. doi: 10.1016/j.csbj.2024.09.018. eCollection 2024 Dec.

Challenges in Bioinformatics Workflows for Processing Microbiome Omics Data at Scale.

Front Bioinform. 2022 Jan 17;1:826370. doi: 10.3389/fbinf.2021.826370. eCollection 2021.

AnVILWorkflow: A runnable workflow package for Cloud-implemented bioinformatics analysis pipelines.

F1000Res. 2024 Oct 21;13:1257. doi: 10.12688/f1000research.155449.1. eCollection 2024.

Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support.

BMC Bioinformatics. 2012 May 4;13:77. doi: 10.1186/1471-2105-13-77.

caTissue Suite to OpenSpecimen: Developing an extensible, open source, web-based biobanking management system.

J Biomed Inform. 2015 Oct;57:456-64. doi: 10.1016/j.jbi.2015.08.020. Epub 2015 Aug 29.

PhenoMeNal: processing and analysis of metabolomics data in the cloud.

Gigascience. 2019 Feb 1;8(2). doi: 10.1093/gigascience/giy149.

gcMeta: a Global Catalogue of Metagenomics platform to support the archiving, standardization and analysis of microbiome data.

Nucleic Acids Res. 2019 Jan 8;47(D1):D637-D648. doi: 10.1093/nar/gky1008.

VDJServer: A Cloud-Based Analysis Portal and Data Commons for Immune Repertoire Sequences and Rearrangements.

Front Immunol. 2018 May 8;9:976. doi: 10.3389/fimmu.2018.00976. eCollection 2018.

AnVILWorkflow: A runnable workflow package for Cloud-implemented bioinformatics analysis pipelines.

Res Sq. 2024 May 15:rs.3.rs-4370115. doi: 10.21203/rs.3.rs-4370115/v1.

MicrobiomeStatPlots: Microbiome statistics plotting gallery for meta-omics and bioinformatics.

Imeta. 2025 Feb 17;4(1):e70002. doi: 10.1002/imt2.70002. eCollection 2025 Feb.

引用本文的文献

The Future of a Myriad of Accelerated Biodiscoveries Lies in AI-Powered Mass Spectrometry and Multiomics Integration.

J Mass Spectrom. 2025 Aug;60(8):e5157. doi: 10.1002/jms.5157.

A cost and community perspective on the barriers to microbiome data reuse.

Front Bioinform. 2025 Apr 9;5:1585717. doi: 10.3389/fbinf.2025.1585717. eCollection 2025.

Quantifying the impact of workshops promoting microbiome data standards and data stewardship.

Sci Rep. 2025 Mar 22;15(1):9887. doi: 10.1038/s41598-025-89991-1.

本文引用的文献

Unveiling microbial diversity: harnessing long-read sequencing technology.

Nat Methods. 2024 Jun;21(6):954-966. doi: 10.1038/s41592-024-02262-1. Epub 2024 Apr 30.

CyVerse: Cyberinfrastructure for open science.

PLoS Comput Biol. 2024 Feb 7;20(2):e1011270. doi: 10.1371/journal.pcbi.1011270. eCollection 2024 Feb.

Identification of mobile genetic elements with geNomad.

Nat Biotechnol. 2024 Aug;42(8):1303-1312. doi: 10.1038/s41587-023-01953-y. Epub 2023 Sep 21.

The IMG/M data management and analysis system v.7: content updates and new features.

Nucleic Acids Res. 2023 Jan 6;51(D1):D723-D732. doi: 10.1093/nar/gkac976.

Challenges in Bioinformatics Workflows for Processing Microbiome Omics Data at Scale.

Front Bioinform. 2022 Jan 17;1:826370. doi: 10.3389/fbinf.2021.826370. eCollection 2021.

Short- and long-read metagenomics expand individualized structural variations in gut microbiomes.

Nat Commun. 2022 Jun 8;13(1):3175. doi: 10.1038/s41467-022-30857-9.

EDGE COVID-19: a web platform to generate submission-ready genomes from SARS-CoV-2 sequencing efforts.

Bioinformatics. 2022 May 13;38(10):2700-2704. doi: 10.1093/bioinformatics/btac176.

Finding the right fit: evaluation of short-read and long-read sequencing approaches to maximize the utility of clinical microbiome data.

Microb Genom. 2022 Mar;8(3). doi: 10.1099/mgen.0.000794.

The Sequence Read Archive: a decade more of explosive growth.

Nucleic Acids Res. 2022 Jan 7;50(D1):D387-D390. doi: 10.1093/nar/gkab1053.

DOE JGI Metagenome Workflow.

mSystems. 2021 May 18;6(3):e00804-20. doi: 10.1128/mSystems.00804-20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过NMDC EDGE资源实现标准化且可访问的多组学生物信息学工作流程。

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.

作者信息

机构信息

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA.

Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

出版信息

Comput Struct Biotechnol J. 2024 Sep 27;23:3575-3583. doi: 10.1016/j.csbj.2024.09.018. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.09.018

PMID:39963423

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11832004/

Abstract

摘要

通过NMDC EDGE资源实现标准化且可访问的多组学生物信息学工作流程。

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过NMDC EDGE资源实现标准化且可访问的多组学生物信息学工作流程。

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献