Division of Nutritional Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
J Nutr. 2023 Jan 14;152(12):2956-2965. doi: 10.1093/jn/nxac195.
The fecal metabolome is affected by diet and includes metabolites generated by human and microbial metabolism. Advances in -omics technologies and analytic approaches have allowed researchers to identify metabolites and better utilize large data sets to generate usable information. One promising aspect of these advancements is the ability to determine objective biomarkers of food intake.
We aimed to utilize a multivariate, machine learning approach to identify metabolite biomarkers that accurately predict food intake.
Data were aggregated from 5 controlled feeding studies in adults that tested the impact of specific foods (almonds, avocados, broccoli, walnuts, barley, and oats) on the gastrointestinal microbiota. Fecal samples underwent GC-MS metabolomic analysis; 344 metabolites were detected in preintervention samples, whereas 307 metabolites were detected postintervention. After removing metabolites that were only detected in either pre- or postintervention and those undetectable in ≥80% of samples in all study groups, changes in 96 metabolites relative concentrations (treatment postintervention minus preintervention) were utilized in random forest models to 1) examine the relation between food consumption and fecal metabolome changes and 2) rank the fecal metabolites by their predictive power (i.e., feature importance score).
Using the change in relative concentration of 96 fecal metabolites, 6 single-food random forest models for almond, avocado, broccoli, walnuts, whole-grain barley, and whole-grain oats revealed prediction accuracies between 47% and 89%. When comparing foods with one another, almond intake was differentiated from walnut intake with 91% classification accuracy.
Our findings reveal promise in utilizing fecal metabolites as objective complements to certain self-reported food intake estimates. Future research on other foods at different doses and dietary patterns is needed to identify biomarkers that can be applied in feeding study compliance and clinical settings.
粪便代谢组受饮食影响,包括人体和微生物代谢产生的代谢物。组学技术和分析方法的进步使研究人员能够识别代谢物,并更好地利用大数据集生成有用信息。这些进展的一个有前途的方面是确定食物摄入的客观生物标志物的能力。
我们旨在利用多元、机器学习方法来识别准确预测食物摄入的代谢物生物标志物。
数据来自 5 项成人受控喂养研究的汇总,这些研究测试了特定食物(杏仁、鳄梨、西兰花、核桃、大麦和燕麦)对胃肠道微生物群的影响。粪便样本进行 GC-MS 代谢组学分析;在干预前样本中检测到 344 种代谢物,而在干预后样本中检测到 307 种代谢物。在去除仅在干预前或干预后检测到的代谢物以及在所有研究组中≥80%的样本中无法检测到的代谢物后,利用随机森林模型分析 96 种相对浓度(干预后减去干预前)变化的代谢物,以 1)检查食物消耗与粪便代谢组变化之间的关系和 2)根据其预测能力(即特征重要性得分)对粪便代谢物进行排名。
使用 96 种粪便代谢物相对浓度的变化,6 种单一食物随机森林模型(杏仁、鳄梨、西兰花、核桃、全谷物大麦和全谷物燕麦)显示预测准确性在 47%至 89%之间。当比较食物彼此之间时,杏仁摄入量与核桃摄入量的分类准确率达到 91%。
我们的研究结果表明,利用粪便代谢物作为某些自我报告食物摄入量估计的客观补充是有希望的。需要对其他不同剂量和饮食模式的食物进行进一步研究,以确定可应用于喂养研究依从性和临床环境的生物标志物。