Suppr超能文献

MetaMeta:整合宏基因组分析工具以改善分类剖析。

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling.

机构信息

Research Group Bioinformatics (NG4), Robert Koch Institute, Nordufer 20, Berlin, 13353, Germany.

CAPES Foundation, Ministry of Education of Brazil, Brasília, 70040-020, DF, Brazil.

出版信息

Microbiome. 2017 Aug 14;5(1):101. doi: 10.1186/s40168-017-0318-y.

Abstract

BACKGROUND

Many metagenome analysis tools are presently available to classify sequences and profile environmental samples. In particular, taxonomic profiling and binning methods are commonly used for such tasks. Tools available among these two categories make use of several techniques, e.g., read mapping, k-mer alignment, and composition analysis. Variations on the construction of the corresponding reference sequence databases are also common. In addition, different tools provide good results in different datasets and configurations. All this variation creates a complicated scenario to researchers to decide which methods to use. Installation, configuration and execution can also be difficult especially when dealing with multiple datasets and tools.

RESULTS

We propose MetaMeta: a pipeline to execute and integrate results from metagenome analysis tools. MetaMeta provides an easy workflow to run multiple tools with multiple samples, producing a single enhanced output profile for each sample. MetaMeta includes a database generation, pre-processing, execution, and integration steps, allowing easy execution and parallelization. The integration relies on the co-occurrence of organisms from different methods as the main feature to improve community profiling while accounting for differences in their databases.

CONCLUSIONS

In a controlled case with simulated and real data, we show that the integrated profiles of MetaMeta overcome the best single profile. Using the same input data, it provides more sensitive and reliable results with the presence of each organism being supported by several methods. MetaMeta uses Snakemake and has six pre-configured tools, all available at BioConda channel for easy installation (conda install -c bioconda metameta). The MetaMeta pipeline is open-source and can be downloaded at: https://gitlab.com/rki_bioinformatics .

摘要

背景

目前有许多宏基因组分析工具可用于对序列进行分类并对环境样本进行分析。特别是,分类分析和分类学方法常用于此类任务。这两类工具都使用了多种技术,例如读取映射、k-mer 比对和组成分析。相应参考序列数据库构建的变化也很常见。此外,不同的工具在不同的数据集和配置下提供了良好的结果。所有这些变化使得研究人员很难决定使用哪些方法。安装、配置和执行也可能很困难,特别是在处理多个数据集和工具时。

结果

我们提出了 MetaMeta:一个用于执行和整合宏基因组分析工具结果的管道。MetaMeta 提供了一个简单的工作流程,用于对多个样本运行多个工具,为每个样本生成单个增强的输出概况。MetaMeta 包括数据库生成、预处理、执行和集成步骤,允许轻松执行和并行化。该集成依赖于不同方法中生物体的共现作为主要特征,以提高群落分析,同时考虑到它们的数据库差异。

结论

在一个带有模拟和真实数据的受控案例中,我们表明 MetaMeta 的集成概况优于最佳单一概况。使用相同的输入数据,它提供了更敏感和可靠的结果,每个生物体的存在都得到了几种方法的支持。MetaMeta 使用 Snakemake 并具有六个预配置的工具,所有工具均可在 BioConda 频道中轻松安装(conda install -c bioconda metameta)。MetaMeta 管道是开源的,可以在以下网址下载:https://gitlab.com/rki_bioinformatics

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/78d8/5557516/1cac73660926/40168_2017_318_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验