• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种可扩展且通用的扩增子序列数据分析流程,可提供可重复且有文档记录的结果。

: A Scalable and Versatile Amplicon Sequence Data Analysis Pipeline Delivering Reproducible and Documented Results.

作者信息

Abdala Asbun Alejandro, Besseling Marc A, Balzano Sergio, van Bleijswijk Judith D L, Witte Harry J, Villanueva Laura, Engelmann Julia C

机构信息

Department of Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Texel, Netherlands.

Department of Earth Sciences, Faculty of Geosciences, Utrecht University, Utrecht, Netherlands.

出版信息

Front Genet. 2020 Nov 20;11:489357. doi: 10.3389/fgene.2020.489357. eCollection 2020.

DOI:10.3389/fgene.2020.489357
PMID:33329686
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7718033/
Abstract

Marker gene sequencing of the rRNA operon (16S, 18S, ITS) or cytochrome c oxidase I (CO1) is a popular means to assess microbial communities of the environment, microbiomes associated with plants and animals, as well as communities of multicellular organisms environmental DNA sequencing. Since this technique is based on sequencing a single gene, or even only parts of a single gene rather than the entire genome, the number of reads needed per sample to assess the microbial community structure is lower than that required for metagenome sequencing. This makes marker gene sequencing affordable to nearly any laboratory. Despite the relative ease and cost-efficiency of data generation, analyzing the resulting sequence data requires computational skills that may go beyond the standard repertoire of a current molecular biologist/ecologist. We have developed , a scalable, flexible, and easy-to-use amplicon sequence data analysis pipeline, which uses Snakemake and a combination of existing and newly developed solutions for its computational steps. takes the raw data as input and delivers a table of operational taxonomic units (OTUs) or Amplicon Sequence Variants (ASVs) in BIOM and text format and representative sequences. is a highly versatile software that allows users to customize several steps of the pipeline, such as selecting from a set of OTU clustering methods or performing ASV analysis. In addition, we designed to run in any linux/unix computing environment from desktop computers to computing servers making use of parallel processing if possible. The analyses and results are fully reproducible and documented in an HTML and optional pdf report. is freely available at Github: https://github.com/AlejandroAb/CASCABEL.

摘要

对rRNA操纵子(16S、18S、ITS)或细胞色素c氧化酶I(CO1)进行标记基因测序,是评估环境微生物群落、与动植物相关的微生物组以及多细胞生物群落环境DNA测序的常用方法。由于该技术基于对单个基因甚至单个基因的部分而非整个基因组进行测序,因此评估微生物群落结构所需的每个样本读取数低于宏基因组测序所需的读取数。这使得几乎任何实验室都能负担得起标记基因测序。尽管数据生成相对容易且具有成本效益,但分析所得的序列数据需要的计算技能可能超出当前分子生物学家/生态学家的标准技能范围。我们开发了一种可扩展、灵活且易于使用的扩增子序列数据分析流程,该流程在其计算步骤中使用Snakemake以及现有和新开发的解决方案的组合。该流程以原始数据为输入,并以BIOM和文本格式以及代表性序列提供操作分类单元(OTU)或扩增子序列变体(ASV)的表格。这是一款高度通用的软件,允许用户自定义流程的多个步骤,例如从一组OTU聚类方法中进行选择或执行ASV分析。此外,我们将其设计为可在从台式计算机到计算服务器的任何Linux/Unix计算环境中运行,并尽可能利用并行处理。分析和结果完全可重现,并记录在HTML和可选的pdf报告中。该软件可在Github上免费获取:https://github.com/AlejandroAb/CASCABEL。

相似文献

1
: A Scalable and Versatile Amplicon Sequence Data Analysis Pipeline Delivering Reproducible and Documented Results.一种可扩展且通用的扩增子序列数据分析流程,可提供可重复且有文档记录的结果。
Front Genet. 2020 Nov 20;11:489357. doi: 10.3389/fgene.2020.489357. eCollection 2020.
2
Natrix: a Snakemake-based workflow for processing, clustering, and taxonomically assigning amplicon sequencing reads.Natrix:一个基于 SnakeMake 的工作流程,用于处理、聚类和分类分配扩增子测序reads。
BMC Bioinformatics. 2020 Nov 16;21(1):526. doi: 10.1186/s12859-020-03852-4.
3
Dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology. dada 蛇,一个 DADA2 的 Snakemake 实现,用于处理微生物生态学的扩增子测序数据。
Gigascience. 2020 Nov 30;9(12). doi: 10.1093/gigascience/giaa135.
4
Piphillin predicts metagenomic composition and dynamics from DADA2-corrected 16S rDNA sequences.Piphillin 可根据 DADA2 校正的 16S rDNA 序列预测宏基因组组成和动态。
BMC Genomics. 2020 Jan 17;21(1):56. doi: 10.1186/s12864-019-6427-1.
5
ASAP 2: a pipeline and web server to analyze marker gene amplicon sequencing data automatically and consistently.ASAP 2:一个用于自动和一致地分析标记基因扩增子测序数据的流水线和网络服务器。
BMC Bioinformatics. 2022 Jan 6;23(1):27. doi: 10.1186/s12859-021-04555-0.
6
From reads to operational taxonomic units: an ensemble processing pipeline for MiSeq amplicon sequencing data.从读取到可操作分类单元:用于MiSeq扩增子测序数据的集成处理流程
Gigascience. 2017 Feb 1;6(2):1-10. doi: 10.1093/gigascience/giw017.
7
Amplicon Sequence Variants Artificially Split Bacterial Genomes into Separate Clusters.扩增子序列变异将细菌基因组人为地分成单独的聚类。
mSphere. 2021 Aug 25;6(4):e0019121. doi: 10.1128/mSphere.00191-21. Epub 2021 Jul 21.
8
Improved OTU-picking using long-read 16S rRNA gene amplicon sequencing and generic hierarchical clustering.利用长读长16S rRNA基因扩增子测序和通用层次聚类改进操作分类单元(OTU)挑选
Microbiome. 2015 Oct 5;3:43. doi: 10.1186/s40168-015-0105-6.
9
LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis.LotuS2:一种用于扩增子测序分析的超快速、高度准确的工具。
Microbiome. 2022 Oct 19;10(1):176. doi: 10.1186/s40168-022-01365-1.
10
Ecological Observations Based on Functional Gene Sequencing Are Sensitive to the Amplicon Processing Method.基于功能基因测序的生态观测对扩增子处理方法敏感。
mSphere. 2022 Aug 31;7(4):e0032422. doi: 10.1128/msphere.00324-22. Epub 2022 Aug 8.

引用本文的文献

1
Nutrient-driven growth and microbiome shifts in the brown alga Sargassum fluitans III.营养驱动的漂浮马尾藻三号褐藻生长及微生物群落变化
J Phycol. 2025 Aug;61(4):933-950. doi: 10.1111/jpy.70045. Epub 2025 Jun 20.
2
Methanotroph-methylotroph lipid adaptations to changing environmental conditions.甲烷营养菌-甲基营养菌对不断变化的环境条件的脂质适应性。
Front Microbiol. 2025 Feb 7;16:1532719. doi: 10.3389/fmicb.2025.1532719. eCollection 2025.
3
Spatial and temporal variation of Antarctic microbial interactions: a study around the west Antarctic Peninsula.

本文引用的文献

1
PEMA: a flexible Pipeline for Environmental DNA Metabarcoding Analysis of the 16S/18S ribosomal RNA, ITS, and COI marker genes.PEMA:用于环境 DNA 宏条形码分析 16S/18S 核糖体 RNA、ITS 和 COI 标记基因的灵活管道。
Gigascience. 2020 Mar 1;9(3). doi: 10.1093/gigascience/giaa022.
2
NG-Tax 2.0: A Semantic Framework for High-Throughput Amplicon Analysis.NG-Tax 2.0:一种用于高通量扩增子分析的语义框架。
Front Genet. 2020 Jan 23;10:1366. doi: 10.3389/fgene.2019.01366. eCollection 2019.
3
Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis.
南极微生物相互作用的时空变化:围绕南极半岛西部的一项研究。
Environ Microbiome. 2025 Feb 8;20(1):21. doi: 10.1186/s40793-025-00663-z.
4
RiboSnake - a user-friendly, robust, reproducible, multipurpose and documentation-extensive pipeline for 16S rRNA gene microbiome analysis.RiboSnake——一个用于16S rRNA基因微生物组分析的用户友好、强大、可重复、多用途且文档丰富的流程。
GigaByte. 2024 Aug 31;2024:gigabyte132. doi: 10.46471/gigabyte.132. eCollection 2024.
5
Organic matter degradation in the deep, sulfidic waters of the Black Sea: insights into the ecophysiology of novel anaerobic bacteria.黑海深硫水区的有机物降解:新型厌氧菌的生态生理学研究。
Microbiome. 2024 May 27;12(1):98. doi: 10.1186/s40168-024-01816-x.
6
Developing a genetic approach to target cyanobacterial producers of heterocyte glycolipids in the environment.开发一种遗传方法,以靶向环境中异形胞糖脂的蓝藻生产者。
Front Microbiol. 2023 Sep 27;14:1257040. doi: 10.3389/fmicb.2023.1257040. eCollection 2023.
7
A pile of pipelines: An overview of the bioinformatics software for metabarcoding data analyses.一堆管道:代谢组学数据分析的生物信息学软件概述。
Mol Ecol Resour. 2024 Jul;24(5):e13847. doi: 10.1111/1755-0998.13847. Epub 2023 Aug 7.
8
Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities.拼接成对的末端读取可提高微生物群落分析中扩增子分类的分类学分类。
BMC Bioinformatics. 2021 Oct 12;22(1):493. doi: 10.1186/s12859-021-04410-2.
9
Microbial Communities on Plastic Polymers in the Mediterranean Sea.地中海塑料聚合物上的微生物群落
Front Microbiol. 2021 Jun 16;12:673553. doi: 10.3389/fmicb.2021.673553. eCollection 2021.
10
Assessing the Effect of Humic Substances and Fe(III) as Potential Electron Acceptors for Anaerobic Methane Oxidation in a Marine Anoxic System.评估腐殖质和Fe(III)作为海洋缺氧系统中厌氧甲烷氧化潜在电子受体的作用。
Microorganisms. 2020 Aug 24;8(9):1288. doi: 10.3390/microorganisms8091288.
16S rRNA 基因测序在微生物组物种和菌株水平分析中的评估。
Nat Commun. 2019 Nov 6;10(1):5029. doi: 10.1038/s41467-019-13036-1.
4
Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.使用QIIME 2进行可重复、交互式、可扩展和可延伸的微生物组数据科学研究。
Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.
5
iMAP: an integrated bioinformatics and visualization pipeline for microbiome data analysis.iMAP:用于微生物组数据分析的集成生物信息学和可视化管道。
BMC Bioinformatics. 2019 Jul 3;20(1):374. doi: 10.1186/s12859-019-2965-4.
6
Performance of Microbiome Sequence Inference Methods in Environments with Varying Biomass.微生物组序列推断方法在不同生物量环境中的性能
mSystems. 2019 Feb 19;4(1). doi: 10.1128/mSystems.00163-18. eCollection 2019 Jan-Feb.
7
SLIM: a flexible web application for the reproducible processing of environmental DNA metabarcoding data.SLIM:一个灵活的网络应用程序,用于可重复处理环境 DNA metabarcoding 数据。
BMC Bioinformatics. 2019 Feb 19;20(1):88. doi: 10.1186/s12859-019-2663-2.
8
BTW-Bioinformatics Through Windows: an easy-to-install package to analyze marker gene data.BTW - 通过Windows进行生物信息学分析:一个易于安装的用于分析标记基因数据的软件包。
PeerJ. 2018 Jul 30;6:e5299. doi: 10.7717/peerj.5299. eCollection 2018.
9
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.Galaxy 平台:用于可访问、可重复和协作的生物医学分析:2018 年更新。
Nucleic Acids Res. 2018 Jul 2;46(W1):W537-W544. doi: 10.1093/nar/gky379.
10
SEED 2: a user-friendly platform for amplicon high-throughput sequencing data analyses.SEED 2:一个用户友好的扩增子高通量测序数据分析平台。
Bioinformatics. 2018 Jul 1;34(13):2292-2294. doi: 10.1093/bioinformatics/bty071.