Graduate School of Life and Environmental Sciences, Kyoto Prefectural University, Kyoto, Japan.
Faculty of Life and Environmental Sciences, Kyoto Prefectural University, Kyoto, Japan.
PLoS One. 2023 Feb 2;18(2):e0281288. doi: 10.1371/journal.pone.0281288. eCollection 2023.
Environmental factors affect the growth of microorganisms and therefore alter the composition of microbiota. Correlative analysis of the relationship between metagenomic composition and the environmental gradient can help elucidate key environmental factors and establishment principles for microbial communities. However, a reasonable method to quantitatively compare whole metagenomic data and identify the primary environmental factors for the establishment of microbiota has not been reported so far. In this study, we developed a method to compare whole proteomes deduced from metagenomic shotgun sequencing data, and quantitatively display their phylogenetic relationships as metagenomic trees. We called this method Metagenomic Phylogeny by Average Sequence Similarity (MPASS). We also compared one of the metagenomic trees with dendrograms of environmental factors using a comparison tool for phylogenetic trees. The MPASS method correctly constructed metagenomic trees of simulated metagenomes and soil and water samples. The topology of the metagenomic tree of samples from the Kirishima hot springs area in Japan was highly similarity to that of the dendrograms based on previously reported environmental factors for this area. The topology of the metagenomic tree also reflected the dynamics of microbiota at the taxonomic and functional levels. Our results strongly suggest that MPASS can successfully classify metagenomic shotgun sequencing data based on the similarity of whole protein-coding sequences, and will be useful for the identification of principal environmental factors for the establishment of microbial communities. Custom Perl script for the MPASS pipeline is available at https://github.com/s0sat/MPASS.
环境因素会影响微生物的生长,从而改变微生物群落的组成。对宏基因组组成与环境梯度之间的关系进行相关分析,有助于阐明关键的环境因素和微生物群落的建立原则。然而,到目前为止,还没有报道一种合理的方法来定量比较整个宏基因组数据,并确定微生物群落建立的主要环境因素。在本研究中,我们开发了一种方法来比较宏基因组鸟枪法测序数据推导的整个蛋白质组,并通过平均序列相似性(MPASS)定量显示它们的系统发育关系。我们还使用一种系统发育树比较工具将其中一个宏基因组树与环境因子的系统发育树进行了比较。MPASS 方法正确构建了模拟宏基因组和土壤及水样的宏基因组树。日本雾岛温泉地区样本的宏基因组树的拓扑结构与基于该地区先前报道的环境因子的系统发育树高度相似。宏基因组树的拓扑结构也反映了在分类和功能水平上微生物群的动态。我们的研究结果强烈表明,MPASS 可以根据整个蛋白质编码序列的相似性成功地对宏基因组鸟枪法测序数据进行分类,并将有助于确定微生物群落建立的主要环境因素。MPASS 管道的自定义 Perl 脚本可在 https://github.com/s0sat/MPASS 上获得。