• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于云的微生物群落宏基因组分析的生物膜标志物发现。

Biofilm marker discovery with cloud-based dockerized metagenomics analysis of microbial communities.

机构信息

Biomedical Engineering Department, University of South Dakota, 4800 N. Career Ave., Suite 221, Sioux Falls, South Dakota, 57107, United States.

Google Cloud, 1900 Reston Metro Plaza, Reston, Virginia, 20190, United States.

出版信息

Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae429.

DOI:10.1093/bib/bbae429
PMID:39266450
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11392556/
Abstract

In an environment, microbes often work in communities to achieve most of their essential functions, including the production of essential nutrients. Microbial biofilms are communities of microbes that attach to a nonliving or living surface by embedding themselves into a self-secreted matrix of extracellular polymeric substances. These communities work together to enhance their colonization of surfaces, produce essential nutrients, and achieve their essential functions for growth and survival. They often consist of diverse microbes including bacteria, viruses, and fungi. Biofilms play a critical role in influencing plant phenotypes and human microbial infections. Understanding how these biofilms impact plant health, human health, and the environment is important for analyzing genotype-phenotype-driven rule-of-life functions. Such fundamental knowledge can be used to precisely control the growth of biofilms on a given surface. Metagenomics is a powerful tool for analyzing biofilm genomes through function-based gene and protein sequence identification (functional metagenomics) and sequence-based function identification (sequence metagenomics). Metagenomic sequencing enables a comprehensive sampling of all genes in all organisms present within a biofilm sample. However, the complexity of biofilm metagenomic study warrants the increasing need to follow the Findability, Accessibility, Interoperability, and Reusable (FAIR) Guiding Principles for scientific data management. This will ensure that scientific findings can be more easily validated by the research community. This study proposes a dockerized, self-learning bioinformatics workflow to increase the community adoption of metagenomics toolkits in a metagenomics and meta-transcriptomics investigation. Our biofilm metagenomics workflow self-learning module includes integrated learning resources with an interactive dockerized workflow. This module will allow learners to analyze resources that are beneficial for aggregating knowledge about biofilm marker genes, proteins, and metabolic pathways as they define the composition of specific microbial communities. Cloud and dockerized technology can allow novice learners-even those with minimal knowledge in computer science-to use complicated bioinformatics tools. Our cloud-based, dockerized workflow splits biofilm microbiome metagenomics analyses into four easy-to-follow submodules. A variety of tools are built into each submodule. As students navigate these submodules, they learn about each tool used to accomplish the task. The downstream analysis is conducted using processed data obtained from online resources or raw data processed via Nextflow pipelines. This analysis takes place within Vertex AI's Jupyter notebook instance with R and Python kernels. Subsequently, results are stored and visualized in Google Cloud storage buckets, alleviating the computational burden on local resources. The result is a comprehensive tutorial that guides bioinformaticians of any skill level through the entire workflow. It enables them to comprehend and implement the necessary processes involved in this integrated workflow from start to finish. This manuscript describes the development of a resource module that is part of a learning platform named "NIGMS Sandbox for Cloud-based Learning" https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.

摘要

在环境中,微生物通常在群落中工作,以实现其大部分基本功能,包括生产必需的营养物质。微生物生物膜是微生物群落,通过将自身嵌入到细胞外聚合物基质中来附着在无生命或有生命的表面上。这些群落共同努力,增强对表面的定殖,产生必需的营养物质,并实现其生长和生存的基本功能。它们通常由包括细菌、病毒和真菌在内的多种微生物组成。生物膜在影响植物表型和人类微生物感染方面起着关键作用。了解这些生物膜如何影响植物健康、人类健康和环境,对于分析由基因型-表型驱动的生活规则功能非常重要。这种基础知识可用于精确控制给定表面上生物膜的生长。宏基因组学是一种通过基于功能的基因和蛋白质序列鉴定(功能宏基因组学)和基于序列的功能鉴定(序列宏基因组学)来分析生物膜基因组的强大工具。宏基因组测序能够全面采样生物膜样本中所有存在的生物的所有基因。然而,生物膜宏基因组研究的复杂性需要越来越多地遵循可发现性、可访问性、互操作性和可重用性(FAIR)科学数据管理指导原则。这将确保研究结果更容易被研究界验证。本研究提出了一个基于 Docker 的自学习生物信息学工作流程,以增加社区对生物膜宏基因组工具包在宏基因组学和元转录组学研究中的采用。我们的生物膜宏基因组学工作流程自学习模块包括具有交互 Docker 工作流程的集成学习资源。该模块将允许学习者分析有助于聚集有关生物膜标记基因、蛋白质和代谢途径的知识的资源,因为它们定义了特定微生物群落的组成。云技术和 Docker 可以允许初学者(即使是计算机科学知识有限的初学者)使用复杂的生物信息学工具。我们基于云的、基于 Docker 的工作流程将生物膜微生物组宏基因组分析分为四个易于遵循的子模块。每个子模块都内置了各种工具。当学生浏览这些子模块时,他们将了解用于完成任务的每个工具。下游分析使用从在线资源获得的已处理数据或通过 Nextflow 管道处理的原始数据进行。此分析在 Vertex AI 的 Jupyter 笔记本实例中使用 R 和 Python 内核进行。随后,结果存储并在 Google Cloud 存储桶中可视化,减轻了本地资源的计算负担。结果是一个全面的教程,指导任何技能水平的生物信息学家完成整个工作流程。它使他们能够理解并实施从开始到结束的整个集成工作流程中涉及的必要过程。本文描述了名为“NIGMS 基于云的学习沙盒”的学习平台的一部分资源模块的开发,该平台的网址为 https://github.com/NIGMS/NIGMS-Sandbox。该沙盒的总体起源在本增刊开头的 NIGMS 沙盒社论 [1] 中进行了描述。该模块以交互格式提供有关批量和单细胞 ATAC-seq 数据分析的学习材料,该格式使用适当的云资源进行数据访问和分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/bdc531120f21/bbae429f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/df12d4faac7f/bbae429f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/83351b46dca9/bbae429f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/bdc531120f21/bbae429f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/df12d4faac7f/bbae429f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/83351b46dca9/bbae429f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daf/11392556/bdc531120f21/bbae429f3.jpg

相似文献

1
Biofilm marker discovery with cloud-based dockerized metagenomics analysis of microbial communities.基于云的微生物群落宏基因组分析的生物膜标志物发现。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae429.
2
Transcriptomics and epigenetic data integration learning module on Google Cloud.转录组学和表观遗传学数据集成学习模块在谷歌云上。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae352.
3
Whole-genome bisulfite sequencing data analysis learning module on Google Cloud Platform.基于谷歌云平台的全基因组亚硫酸氢盐测序数据分析学习模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae236.
4
A cloud-based learning module for biomarker discovery.基于云的生物标志物发现学习模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae126.
5
Understanding proteome quantification in an interactive learning module on Google Cloud Platform.理解蛋白质组定量在 Google Cloud Platform 上的交互式学习模块中的应用。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae235.
6
Cloud-based introduction to BASH programming for biologists.基于云的生物学 BASH 编程入门。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae244.
7
CCPA: cloud-based, self-learning modules for consensus pathway analysis using GO, KEGG and Reactome.CCPA:基于云的、使用 GO、KEGG 和 Reactome 的自学习共识途径分析模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae222.
8
Reusable tutorials for using cloud-based computing environments for the analysis of bacterial gene expression data from bulk RNA sequencing.可重复使用的教程,用于使用基于云的计算环境分析来自批量 RNA 测序的细菌基因表达数据。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae301.
9
Identifying and training deep learning neural networks on biomedical-related datasets.在生物医学相关数据集上识别和训练深度学习神经网络。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae232.
10
CloudATAC: a cloud-based framework for ATAC-Seq data analysis.CloudATAC:一个基于云的 ATAC-Seq 数据分析框架。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae090.

引用本文的文献

1
NIGMS Sandbox: a learning platform toward democratizing cloud computing for biomedical research.NIGMS 沙盒:一个学习平台,旨在为生物医学研究普及云计算。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae478.

本文引用的文献

1
Omics approaches in bioremediation of environmental contaminants: An integrated approach for environmental safety and sustainability.组学方法在环境污染物生物修复中的应用:环境安全和可持续性的综合方法。
Environ Res. 2022 Aug;211:113102. doi: 10.1016/j.envres.2022.113102. Epub 2022 Mar 14.
2
Trimming and Validation of Illumina Short Reads Using Trimmomatic, Trinity Assembly, and Assessment of RNA-Seq Data.使用 Trimmomatic、Trinity 组装进行 Illumina 短读段的修剪和验证,以及 RNA-Seq 数据的评估。
Methods Mol Biol. 2022;2443:211-232. doi: 10.1007/978-1-0716-2067-0_11.
3
MetaLAFFA: a flexible, end-to-end, distributed computing-compatible metagenomic functional annotation pipeline.
MetaLAFFA:一个灵活的、端到端的、分布式计算兼容的宏基因组功能注释管道。
BMC Bioinformatics. 2020 Oct 21;21(1):471. doi: 10.1186/s12859-020-03815-9.
4
GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing.GMM-Demux:单细胞测序中的样品分拆、多重检测、实验规划和新型细胞类型验证。
Genome Biol. 2020 Jul 30;21(1):188. doi: 10.1186/s13059-020-02084-2.
5
PICRUSt2 for prediction of metagenome functions.用于宏基因组功能预测的PICRUSt2
Nat Biotechnol. 2020 Jun;38(6):685-688. doi: 10.1038/s41587-020-0548-6.
6
The Biofilms Structural Database.生物膜结构数据库。
Trends Biotechnol. 2020 Sep;38(9):937-940. doi: 10.1016/j.tibtech.2020.04.002. Epub 2020 May 5.
7
QIIME 2 Enables Comprehensive End-to-End Analysis of Diverse Microbiome Data and Comparative Studies with Publicly Available Data.QIIME 2 支持对多样化微生物组数据进行全面的端到端分析,并与公共可用数据进行比较研究。
Curr Protoc Bioinformatics. 2020 Jun;70(1):e100. doi: 10.1002/cpbi.100.
8
Statistical Analysis of Metagenomics Data.宏基因组学数据的统计分析
Genomics Inform. 2019 Mar;17(1):e6. doi: 10.5808/GI.2019.17.1.e6. Epub 2019 Mar 31.
9
16S rRNA Gene Analysis with QIIME2.使用QIIME2进行16S rRNA基因分析。
Methods Mol Biol. 2018;1849:113-129. doi: 10.1007/978-1-4939-8728-3_8.
10
Bacterial Quorum Sensing and Microbial Community Interactions.细菌群体感应与微生物群落相互作用。
mBio. 2018 May 22;9(3):e02331-17. doi: 10.1128/mBio.02331-17.