MetaMeta：整合宏基因组分析工具以改善分类剖析。

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling.

机构信息

Research Group Bioinformatics (NG4), Robert Koch Institute, Nordufer 20, Berlin, 13353, Germany.

CAPES Foundation, Ministry of Education of Brazil, Brasília, 70040-020, DF, Brazil.

出版信息

Microbiome. 2017 Aug 14;5(1):101. doi: 10.1186/s40168-017-0318-y.

DOI:10.1186/s40168-017-0318-y

PMID:28807044

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5557516/

Abstract

BACKGROUND

Many metagenome analysis tools are presently available to classify sequences and profile environmental samples. In particular, taxonomic profiling and binning methods are commonly used for such tasks. Tools available among these two categories make use of several techniques, e.g., read mapping, k-mer alignment, and composition analysis. Variations on the construction of the corresponding reference sequence databases are also common. In addition, different tools provide good results in different datasets and configurations. All this variation creates a complicated scenario to researchers to decide which methods to use. Installation, configuration and execution can also be difficult especially when dealing with multiple datasets and tools.

RESULTS

We propose MetaMeta: a pipeline to execute and integrate results from metagenome analysis tools. MetaMeta provides an easy workflow to run multiple tools with multiple samples, producing a single enhanced output profile for each sample. MetaMeta includes a database generation, pre-processing, execution, and integration steps, allowing easy execution and parallelization. The integration relies on the co-occurrence of organisms from different methods as the main feature to improve community profiling while accounting for differences in their databases.

CONCLUSIONS

In a controlled case with simulated and real data, we show that the integrated profiles of MetaMeta overcome the best single profile. Using the same input data, it provides more sensitive and reliable results with the presence of each organism being supported by several methods. MetaMeta uses Snakemake and has six pre-configured tools, all available at BioConda channel for easy installation (conda install -c bioconda metameta). The MetaMeta pipeline is open-source and can be downloaded at: https://gitlab.com/rki_bioinformatics .

摘要

背景

目前有许多宏基因组分析工具可用于对序列进行分类并对环境样本进行分析。特别是，分类分析和分类学方法常用于此类任务。这两类工具都使用了多种技术，例如读取映射、k-mer 比对和组成分析。相应参考序列数据库构建的变化也很常见。此外，不同的工具在不同的数据集和配置下提供了良好的结果。所有这些变化使得研究人员很难决定使用哪些方法。安装、配置和执行也可能很困难，特别是在处理多个数据集和工具时。

结果

我们提出了 MetaMeta：一个用于执行和整合宏基因组分析工具结果的管道。MetaMeta 提供了一个简单的工作流程，用于对多个样本运行多个工具，为每个样本生成单个增强的输出概况。MetaMeta 包括数据库生成、预处理、执行和集成步骤，允许轻松执行和并行化。该集成依赖于不同方法中生物体的共现作为主要特征，以提高群落分析，同时考虑到它们的数据库差异。

结论

在一个带有模拟和真实数据的受控案例中，我们表明 MetaMeta 的集成概况优于最佳单一概况。使用相同的输入数据，它提供了更敏感和可靠的结果，每个生物体的存在都得到了几种方法的支持。MetaMeta 使用 Snakemake 并具有六个预配置的工具，所有工具均可在 BioConda 频道中轻松安装（conda install -c bioconda metameta）。MetaMeta 管道是开源的，可以在以下网址下载：https://gitlab.com/rki_bioinformatics 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/78d8/5557516/1cac73660926/40168_2017_318_Fig1_HTML.jpg

相似文献

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling.MetaMeta：整合宏基因组分析工具以改善分类剖析。

Microbiome. 2017 Aug 14;5(1):101. doi: 10.1186/s40168-017-0318-y.

ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data.ATLAS：用于宏基因组序列数据组装、注释和基因组分箱的 SnakeMake 工作流程。

BMC Bioinformatics. 2020 Jun 22;21(1):257. doi: 10.1186/s12859-020-03585-4.

NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language.NG-meta-profiler：使用特定于领域的语言 NGLess 快速处理宏基因组。

Microbiome. 2019 Jun 3;7(1):84. doi: 10.1186/s40168-019-0684-8.

MetaSAMS--a novel software platform for taxonomic classification, functional annotation and comparative analysis of metagenome datasets.MetaSAMS——一个用于宏基因组数据集的分类学分类、功能注释和比较分析的新型软件平台。

J Biotechnol. 2013 Aug 20;167(2):156-65. doi: 10.1016/j.jbiotec.2012.09.013. Epub 2012 Sep 29.

TAMA: improved metagenomic sequence classification through meta-analysis.TAMA：通过荟萃分析改进宏基因组序列分类。

BMC Bioinformatics. 2020 May 12;21(1):185. doi: 10.1186/s12859-020-3533-7.

CAMISIM: simulating metagenomes and microbial communities.CAMISIM：模拟宏基因组和微生物群落。

Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.

Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics.分析分类注释的测序策略和工具：为逐步宏基因组学定义标准。

Sci Rep. 2018 Aug 13;8(1):12034. doi: 10.1038/s41598-018-30515-5.

Genome Recovery, Functional Profiling, and Taxonomic Classification from Metagenomes.从宏基因组中进行基因组回收、功能分析和分类学分类。

Methods Mol Biol. 2021;2242:153-172. doi: 10.1007/978-1-0716-1099-2_10.

BugSplit enables genome-resolved metagenomics through highly accurate taxonomic binning of metagenomic assemblies.BugSplit 通过对宏基因组组装进行高度准确的分类-bin 操作，实现了基因组分辨率的宏基因组学。

Commun Biol. 2022 Feb 22;5(1):151. doi: 10.1038/s42003-022-03114-4.

DIAMOND + MEGAN Microbiome Analysis.DIAMOND + MEGAN 微生物组分析。

Methods Mol Biol. 2023;2649:107-131. doi: 10.1007/978-1-0716-3072-3_6.

引用本文的文献

Application and Comparison of Machine Learning and Database-Based Methods in Taxonomic Classification of High-Throughput Sequencing Data.基于机器学习和数据库的方法在高通量测序数据分类中的应用与比较。

Genome Biol Evol. 2024 May 2;16(5). doi: 10.1093/gbe/evae102.

MicroPredict: predicting species-level taxonomic abundance of whole-shotgun metagenomic data using only 16S amplicon sequencing data.MicroPredict：仅使用 16S 扩增子测序数据预测全基因组宏基因组数据的种级分类丰度。

Genes Genomics. 2024 Jun;46(6):701-712. doi: 10.1007/s13258-024-01514-w. Epub 2024 May 3.

SnakeLines: integrated set of computational pipelines for sequencing reads.SnakeLines：一套用于测序读取的集成计算管道。

J Integr Bioinform. 2023 Aug 21;20(3). doi: 10.1515/jib-2022-0059. eCollection 2023 Sep 1.

Supervised Machine Learning Enables Geospatial Microbial Provenance.监督机器学习实现了微生物的地理来源。

Genes (Basel). 2022 Oct 21;13(10):1914. doi: 10.3390/genes13101914.

Drastic reduction of false positive species in samples of insects by intersecting the default output of two popular metagenomic classifiers.通过交叉比较两种流行的宏基因组分类器的默认输出，大幅减少昆虫样本中的假阳性物种。

PLoS One. 2022 Oct 25;17(10):e0275790. doi: 10.1371/journal.pone.0275790. eCollection 2022.

PathoLive-Real-Time Pathogen Identification from Metagenomic Illumina Datasets.PathoLive-从宏基因组Illumina数据集中实时鉴定病原体

Life (Basel). 2022 Aug 30;12(9):1345. doi: 10.3390/life12091345.

Crowdsourced benchmarking of taxonomic metagenome profilers: lessons learned from the sbv IMPROVER Microbiomics challenge.众包分类学宏基因组分析器的基准测试：从 sbv IMPROVER 微生物组学挑战赛中吸取的经验教训。

BMC Genomics. 2022 Aug 30;23(1):624. doi: 10.1186/s12864-022-08803-2.

ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing.ReadBouncer：适用于纳米孔测序的精确和可扩展自适应采样。

Bioinformatics. 2022 Jun 24;38(Suppl 1):i153-i160. doi: 10.1093/bioinformatics/btac223.

Growth promotion and antibiotic induced metabolic shifts in the chicken gut microbiome.促进生长和抗生素诱导的鸡肠道微生物组代谢转变。

Commun Biol. 2022 Apr 1;5(1):293. doi: 10.1038/s42003-022-03239-6.

NGS read classification using AI.使用人工智能进行二代测序（NGS）读数分类

PLoS One. 2021 Dec 22;16(12):e0261548. doi: 10.1371/journal.pone.0261548. eCollection 2021.

本文引用的文献

WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences.WEVOTE：微生物序列的加权投票分类鉴定方法

PLoS One. 2016 Sep 28;11(9):e0163527. doi: 10.1371/journal.pone.0163527. eCollection 2016.

DUDes: a top-down taxonomic profiler for metagenomics.DUDes：宏基因组的一种自上而下的分类分析工具。

Bioinformatics. 2016 Aug 1;32(15):2272-80. doi: 10.1093/bioinformatics/btw150. Epub 2016 Mar 24.

Fast and sensitive taxonomic classification for metagenomics with Kaiju.使用Kaiju对宏基因组学进行快速且灵敏的分类学分类。

Nat Commun. 2016 Apr 13;7:11257. doi: 10.1038/ncomms11257.

An evaluation of the accuracy and speed of metagenome analysis tools.宏基因组分析工具的准确性和速度评估。

Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.

Evaluation of shotgun metagenomics sequence classification methods using in silico and in vitro simulated communities.使用计算机模拟和体外模拟群落评估鸟枪法宏基因组学序列分类方法

BMC Bioinformatics. 2015 Nov 4;16:363. doi: 10.1186/s12859-015-0788-5.

Bioboxes: standardised containers for interchangeable bioinformatics software.生物信息盒：用于可互换生物信息学软件的标准化容器。

Gigascience. 2015 Oct 15;4:47. doi: 10.1186/s13742-015-0087-0. eCollection 2015.

MetaPhlAn2 for enhanced metagenomic taxonomic profiling.用于增强宏基因组分类分析的MetaPhlAn2

Nat Methods. 2015 Oct;12(10):902-3. doi: 10.1038/nmeth.3589.

Will solid-state drives accelerate your bioinformatics? In-depth profiling, performance analysis and beyond.固态硬盘会加速你的生物信息学进程吗？深入剖析、性能分析及其他。

Brief Bioinform. 2016 Jul;17(4):713-27. doi: 10.1093/bib/bbv073. Epub 2015 Sep 1.

Challenges and opportunities in understanding microbial communities with metagenome assembly (accompanied by IPython Notebook tutorial).利用宏基因组组装理解微生物群落中的挑战与机遇（附IPython Notebook教程）

Front Microbiol. 2015 Jul 9;6:678. doi: 10.3389/fmicb.2015.00678. eCollection 2015.

Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies.宏基因组学：用于分析源自生物多样性研究的下一代测序数据的工具与见解。

Bioinform Biol Insights. 2015 May 5;9:75-88. doi: 10.4137/BBI.S12462. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MetaMeta：整合宏基因组分析工具以改善分类剖析。

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献