School of Biomedical Sciences, Faculty of Medicine and Health, UNSW Sydney, Sydney, New South Wales, Australia.
School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Sydney, New South Wales, Australia.
mSystems. 2024 Sep 17;9(9):e0074624. doi: 10.1128/msystems.00746-24. Epub 2024 Aug 13.
Characterization of microbial community metabolic output is crucial to understanding their functions. Construction of genome-scale metabolic models from metagenome-assembled genomes (MAG) has enabled prediction of metabolite production by microbial communities, yet little is known about their accuracy. Here, we examined the performance of two approaches for metabolite prediction from metagenomes, one that is MAG-guided and another that is taxonomic reference-guided. We applied both on shotgun metagenomics data from human and environmental samples, and validated findings in the human samples using untargeted metabolomics. We found that in human samples, where taxonomic profiling is optimized and reference genomes are readily available, when number of input taxa was normalized, the reference-guided approach predicted more metabolites than the MAG-guided approach. The two approaches showed significant overlap but each identified metabolites not predicted in the other. Pathway enrichment analyses identified significant differences in inferences derived from data based on the approach, highlighting the need for caution in interpretation. In environmental samples, when the number of input taxa was normalized, the reference-guided approach predicted more metabolites than the MAG-guided approach for total metabolites in both sample types and non-redundant metabolites in seawater samples. Nonetheless, as was observed for the human samples, the approaches overlapped substantially but also predicted metabolites not observed in the other. Our findings report on utility of a complementary input to genome-scale metabolic model construction that is less computationally intensive forgoing MAG assembly and refinement, and that can be applied on shallow shotgun sequencing where MAGs cannot be generated.IMPORTANCELittle is known about the accuracy of genome-scale metabolic models (GEMs) of microbial communities despite their influence on inferring community metabolic outputs and culture conditions. The performance of GEMs for metabolite prediction from metagenomes was assessed by applying two approaches on shotgun metagenomics data from human and environmental samples, and validating findings in the human samples using untargeted metabolomics. The performance of the approach was found to be dependent on sample type, but collectively, the reference-guided approach predicted more metabolites than the MAG-guided approach. Despite the differences, the predictions from the approaches overlapped substantially but each identified metabolites not predicted in the other. We found significant differences in biological inferences based on the approach, with some examples of uniquely enriched pathways in one group being invalidated when using the alternative approach, highlighting the need for caution in interpretation of GEMs.
对微生物群落代谢产物进行特征分析对于理解其功能至关重要。基于宏基因组组装基因组(MAG)构建基因组规模的代谢模型,使得能够预测微生物群落产生的代谢产物,但对于其准确性知之甚少。在这里,我们检查了两种从宏基因组预测代谢物的方法的性能,一种是 MAG 指导的方法,另一种是分类参考指导的方法。我们将这两种方法应用于来自人类和环境样本的 shotgun 宏基因组学数据,并使用非靶向代谢组学在人类样本中验证了研究结果。我们发现,在人类样本中,当优化了分类分析并且参考基因组易于获得时,当输入分类单元的数量标准化时,参考指导的方法比 MAG 指导的方法预测到更多的代谢物。这两种方法有显著的重叠,但都发现了另一种方法没有预测到的代谢物。途径富集分析确定了基于方法的数据推断之间的显著差异,突出了在解释方面需要谨慎。在环境样本中,当输入分类单元的数量标准化时,参考指导的方法在两种样本类型的总代谢物和海水样本的非冗余代谢物中预测到的代谢物多于 MAG 指导的方法。尽管如此,与人类样本一样,这两种方法有很大的重叠,但也预测到了另一种方法没有观察到的代谢物。我们的研究结果报告了一种补充基因组规模代谢模型构建的方法的实用性,该方法不进行 MAG 组装和细化,计算强度较低,并且可以应用于无法生成 MAG 的浅层 shotgun 测序中。
重要性尽管基因组规模代谢模型(GEM)对推断群落代谢产物和培养条件有影响,但对于微生物群落的代谢物预测,GEM 的准确性知之甚少。通过将两种方法应用于来自人类和环境样本的 shotgun 宏基因组学数据,并使用非靶向代谢组学在人类样本中验证研究结果,评估了 GEM 用于从宏基因组预测代谢物的性能。发现该方法的性能取决于样本类型,但总体而言,参考指导的方法预测到的代谢物多于 MAG 指导的方法。尽管存在差异,但两种方法的预测结果有很大的重叠,但都发现了另一种方法没有预测到的代谢物。我们发现,基于方法的生物学推断存在显著差异,当使用替代方法时,一些组中独特富集途径的例子被否定,突出了在解释 GEM 时需要谨慎。