Suppr
超能文献

ASAP 2：一个用于自动和一致地分析标记基因扩增子测序数据的流水线和网络服务器。

ASAP 2: a pipeline and web server to analyze marker gene amplicon sequencing data automatically and consistently.

机构信息

Institute for Food Safety and Health, Illinois Institute of Technology, Bedford Park, IL, 60501, USA.

Department of Food Science and Nutrition, Illinois Institute of Technology, Bedford Park, IL, 60501, USA.

出版信息

BMC Bioinformatics. 2022 Jan 6;23(1):27. doi: 10.1186/s12859-021-04555-0.

DOI:10.1186/s12859-021-04555-0

PMID:34991446

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8740450/

Abstract

BACKGROUND

Amplicon sequencing of marker genes such as 16S rDNA have been widely used to survey and characterize microbial community. However, the complex data analyses have required many interfering manual steps often leading to inconsistencies in results.

RESULTS

Here, we have developed a pipeline, amplicon sequence analysis pipeline 2 (ASAP 2), to automate and glide through the processes without the usual manual inspections and user's interference, for instance, in the detection of barcode orientation, selection of high-quality region of reads, and determination of resampling depth and many more. The pipeline integrates all the analytical processes such as importing data, demultiplexing, summarizing read profiles, trimming quality, denoising, removing chimeric sequences and making the feature table among others. The pipeline accepts multiple file formats as input including multiplexed or demultiplexed, paired-end or single-end, barcode inside or outside and raw or intermediate data (e.g. feature table). The outputs include taxonomic classification, alpha/beta diversity, community composition, ordination analysis and statistical tests. ASAP 2 supports merging multiple sequencing runs which helps integrate and compare data from different sources (public databases and collaborators).

CONCLUSIONS

Our pipeline minimizes hands-on interference and runs amplicon sequence variant (ASV)-based amplicon sequencing analysis automatically and consistently. Our web server assists researchers that have no access to high performance computer (HPC) or have limited bioinformatics skills. The pipeline and web server can be accessed at https://github.com/tianrenmaogithub/asap2 and https://hts.iit.edu/asap2 , respectively.

摘要

背景

扩增子测序技术（如 16S rDNA）已广泛用于微生物群落的调查和特征分析。然而，复杂的数据分析需要许多干预性的手动步骤，这往往导致结果不一致。

结果

在这里，我们开发了一个流程，即扩增子序列分析流程 2（ASAP 2），该流程可以自动完成所有步骤，而无需通常的手动检查和用户干预，例如检测条形码方向、选择高质量的读段、确定重采样深度等。该流程集成了所有分析过程，如导入数据、多路分解、读取概况汇总、质量修剪、去噪、去除嵌合体序列以及制作特征表等。该流程接受多种文件格式作为输入，包括多路或多路分解、成对或单端、条形码在内部或外部以及原始或中间数据（例如特征表）。输出包括分类学分类、α/β多样性、群落组成、排序分析和统计检验。ASAP 2 支持合并多个测序运行，这有助于整合和比较来自不同来源的数据（公共数据库和合作者）。

结论

我们的流程最大限度地减少了手工干预，并自动一致地运行基于扩增子序列变异（ASV）的扩增子测序分析。我们的网络服务器可以帮助那些无法访问高性能计算机（HPC）或具有有限生物信息学技能的研究人员。该流程和网络服务器可以分别在 https://github.com/tianrenmaogithub/asap2 和 https://hts.iit.edu/asap2 上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/92e1/8740450/8cbc37cefbfb/12859_2021_4555_Fig1_HTML.jpg

相似文献

ASAP 2: a pipeline and web server to analyze marker gene amplicon sequencing data automatically and consistently.

BMC Bioinformatics. 2022 Jan 6;23(1):27. doi: 10.1186/s12859-021-04555-0.

Systematic processing of ribosomal RNA gene amplicon sequencing data.

Gigascience. 2019 Dec 1;8(12). doi: 10.1093/gigascience/giz146.

Dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology.

Gigascience. 2020 Nov 30;9(12). doi: 10.1093/gigascience/giaa135.

Dadaist2: A Toolkit to Automate and Simplify Statistical Analysis and Plotting of Metabarcoding Experiments.

Int J Mol Sci. 2021 May 18;22(10):5309. doi: 10.3390/ijms22105309.

MeFiT: merging and filtering tool for illumina paired-end reads for 16S rRNA amplicon sequencing.

BMC Bioinformatics. 2016 Dec 1;17(1):491. doi: 10.1186/s12859-016-1358-1.

LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis.

Microbiome. 2022 Oct 19;10(1):176. doi: 10.1186/s40168-022-01365-1.

From reads to operational taxonomic units: an ensemble processing pipeline for MiSeq amplicon sequencing data.

Gigascience. 2017 Feb 1;6(2):1-10. doi: 10.1093/gigascience/giw017.

StreamingTrim 1.0: a Java software for dynamic trimming of 16S rRNA sequence data from metagenetic studies.

Mol Ecol Resour. 2014 Mar;14(2):426-34. doi: 10.1111/1755-0998.12187. Epub 2013 Nov 16.

Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities.

BMC Bioinformatics. 2021 Oct 12;22(1):493. doi: 10.1186/s12859-021-04410-2.

: A Scalable and Versatile Amplicon Sequence Data Analysis Pipeline Delivering Reproducible and Documented Results.

Front Genet. 2020 Nov 20;11:489357. doi: 10.3389/fgene.2020.489357. eCollection 2020.

引用本文的文献

The Maleth program: Malta's first space mission discoveries on the microbiome of diabetic foot ulcers.

Heliyon. 2022 Dec 5;8(12):e12075. doi: 10.1016/j.heliyon.2022.e12075. eCollection 2022 Dec.

Tourmaline: A containerized workflow for rapid and iterable amplicon sequence analysis using QIIME 2 and Snakemake.

Gigascience. 2022 Jul 28;11. doi: 10.1093/gigascience/giac066.

Microbial Richness of Marine Biofilms Revealed by Sequencing Full-Length 16S rRNA Genes.

Genes (Basel). 2022 Jun 12;13(6):1050. doi: 10.3390/genes13061050.

本文引用的文献

Fishing in the Soup - Pathogen Detection in Food Safety Using Metabarcoding and Metagenomic Sequencing.

Front Microbiol. 2019 Aug 6;10:1805. doi: 10.3389/fmicb.2019.01805. eCollection 2019.

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.

Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.

Diazotroph Community Characterization via a High-Throughput Amplicon Sequencing and Analysis Pipeline.

Appl Environ Microbiol. 2018 Jan 31;84(4). doi: 10.1128/AEM.01512-17. Print 2018 Feb 15.

Multiple identification of most important waterborne protozoa in surface water used for irrigation purposes by 18S rRNA amplicon-based metagenomics.

Int J Hyg Environ Health. 2018 Jan;221(1):102-111. doi: 10.1016/j.ijheh.2017.10.008. Epub 2017 Oct 19.

Analysing Microbial Community Composition through Amplicon Sequencing: From Sampling to Hypothesis Testing.

Front Microbiol. 2017 Sep 4;8:1561. doi: 10.3389/fmicb.2017.01561. eCollection 2017.

Fast and Simple Analysis of MiSeq Amplicon Sequencing Data with MetaAmp.

Front Microbiol. 2017 Aug 3;8:1461. doi: 10.3389/fmicb.2017.01461. eCollection 2017.

Diversity and Composition of Sulfate-Reducing Microbial Communities Based on Genomic DNA and RNA Transcription in Production Water of High Temperature and Corrosive Oil Reservoir.

Front Microbiol. 2017 Jun 7;8:1011. doi: 10.3389/fmicb.2017.01011. eCollection 2017.

16S rRNA gene sequencing and healthy reference ranges for 28 clinically relevant microbial taxa from the human gut microbiome.

PLoS One. 2017 May 3;12(5):e0176555. doi: 10.1371/journal.pone.0176555. eCollection 2017.

Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns.

mSystems. 2017 Mar 7;2(2). doi: 10.1128/mSystems.00191-16. eCollection 2017 Mar-Apr.

Accurate Estimation of Fungal Diversity and Abundance through Improved Lineage-Specific Primers Optimized for Illumina Amplicon Sequencing.

Appl Environ Microbiol. 2016 Nov 21;82(24):7217-7226. doi: 10.1128/AEM.02576-16. Print 2016 Dec 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

ASAP 2：一个用于自动和一致地分析标记基因扩增子测序数据的流水线和网络服务器。

ASAP 2: a pipeline and web server to analyze marker gene amplicon sequencing data automatically and consistently.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译