Department of Biomedical Science and Engineering, Konkuk University, Seoul, 05029, Republic of Korea.
BMC Bioinformatics. 2020 May 12;21(1):185. doi: 10.1186/s12859-020-3533-7.
Microorganisms are important occupants of many different environments. Identifying the composition of microbes and estimating their abundance promote understanding of interactions of microbes in environmental samples. To understand their environments more deeply, the composition of microorganisms in environmental samples has been studied using metagenomes, which are the collections of genomes of the microorganisms. Although many tools have been developed for taxonomy analysis based on different algorithms, variability of analysis outputs of existing tools from the same input metagenome datasets is the main obstacle for many researchers in this field.
Here, we present a novel meta-analysis tool for metagenome taxonomy analysis, called TAMA, by intelligently integrating outputs from three different taxonomy analysis tools. Using an integrated reference database, TAMA performs taxonomy assignment for input metagenome reads based on a meta-score by integrating scores of taxonomy assignment from different taxonomy classification tools. TAMA outperformed existing tools when evaluated using various benchmark datasets. It was also successfully applied to obtain relative species abundance profiles and difference in composition of microorganisms in two types of cheese metagenome and human gut metagenome.
TAMA can be easily installed and used for metagenome read classification and the prediction of relative species abundance from multiple numbers and types of metagenome read samples. TAMA can be used to more accurately uncover the composition of microorganisms in metagenome samples collected from various environments, especially when the use of a single taxonomy analysis tool is unreliable. TAMA is an open source tool, and can be downloaded at https://github.com/jkimlab/TAMA.
微生物是许多不同环境中的重要居住者。鉴定微生物的组成并估计其丰度有助于了解环境样本中微生物的相互作用。为了更深入地了解它们的环境,已经使用宏基因组学(即微生物基因组的集合)来研究环境样本中微生物的组成。尽管已经开发了许多基于不同算法的分类分析工具,但来自同一宏基因组数据集的现有工具的分析输出的可变性是该领域许多研究人员的主要障碍。
在这里,我们通过智能地整合三个不同分类分析工具的输出,提出了一种用于宏基因组分类分析的新的元分析工具,称为 TAMA。使用集成参考数据库,TAMA 根据来自不同分类分类工具的分类分配分数的元分数,对输入宏基因组读数进行分类分配。使用各种基准数据集评估时,TAMA 优于现有工具。它还成功地应用于从两种奶酪宏基因组和人类肠道宏基因组中获得相对物种丰度分布和微生物组成的差异。
TAMA 可以轻松安装并用于对来自多个数量和类型的宏基因组读样本进行宏基因组读分类和相对物种丰度的预测。TAMA 可用于更准确地揭示来自各种环境的宏基因组样本中微生物的组成,特别是当使用单个分类分析工具不可靠时。TAMA 是一个开源工具,可以在 https://github.com/jkimlab/TAMA 上下载。