Department CIBIO, University of Trento, Trento, Italy.
Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
Metagenomic assembly enables new organism discovery from microbial communities, but it can only capture few abundant organisms from most metagenomes. Here we present MetaPhlAn 4, which integrates information from metagenome assemblies and microbial isolate genomes for more comprehensive metagenomic taxonomic profiling. From a curated collection of 1.01 M prokaryotic reference and metagenome-assembled genomes, we define unique marker genes for 26,970 species-level genome bins, 4,992 of them taxonomically unidentified at the species level. MetaPhlAn 4 explains ~20% more reads in most international human gut microbiomes and >40% in less-characterized environments such as the rumen microbiome and proves more accurate than available alternatives on synthetic evaluations while also reliably quantifying organisms with no cultured isolates. Application of the method to >24,500 metagenomes highlights previously undetected species to be strong biomarkers for host conditions and lifestyles in human and mouse microbiomes and shows that even previously uncharacterized species can be genetically profiled at the resolution of single microbial strains.
宏基因组组装可从微生物群落中发现新的生物体,但它只能从大多数宏基因组中捕获少数丰富的生物体。在这里,我们展示了 MetaPhlAn 4,它整合了宏基因组组装和微生物分离物基因组的信息,以进行更全面的宏基因组分类分析。从经过精心整理的 1010 万种原核参考和宏基因组组装基因组中,我们为 26970 个种水平的基因组分类群定义了独特的标记基因,其中 4992 个在种水平上未被分类。MetaPhlAn 4 可以解释大多数国际人类肠道微生物组中 20%以上的reads,在较少研究的环境(如瘤胃微生物组)中则可以解释超过 40%的reads,在综合评估中比现有的替代方法更准确,同时还可以可靠地定量没有培养分离物的生物体。该方法在超过 24500 个宏基因组中的应用突出了以前未检测到的物种,这些物种是人类和小鼠微生物组中宿主状况和生活方式的强有力生物标志物,并表明即使是以前未被描述的物种也可以在单个微生物菌株的分辨率下进行遗传分析。