• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习预测微生物群落功能:对凋落叶分解过程中溶解有机碳的分析。

Machine learning to predict microbial community functions: An analysis of dissolved organic carbon from litter decomposition.

机构信息

Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States of America.

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States of America.

出版信息

PLoS One. 2019 Jul 1;14(7):e0215502. doi: 10.1371/journal.pone.0215502. eCollection 2019.

DOI:10.1371/journal.pone.0215502
PMID:31260460
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6602172/
Abstract

Microbial communities are ubiquitous and often influence macroscopic properties of the ecosystems they inhabit. However, deciphering the functional relationship between specific microbes and ecosystem properties is an ongoing challenge owing to the complexity of the communities. This challenge can be addressed, in part, by integrating the advances in DNA sequencing technology with computational approaches like machine learning. Although machine learning techniques have been applied to microbiome data, use of these techniques remains rare, and user-friendly platforms to implement such techniques are not widely available. We developed a tool that implements neural network and random forest models to perform regression and feature selection tasks on microbiome data. In this study, we applied the tool to analyze soil microbiome (16S rRNA gene profiles) and dissolved organic carbon (DOC) data from a 44-day plant litter decomposition experiment. The microbiome data includes 1709 total bacterial operational taxonomic units (OTU) from 300+ microcosms. Regression analysis of predicted and actual DOC for a held-out test set of 51 samples yield Pearson's correlation coefficients of.636 and.676 for neural network and random forest approaches, respectively. Important taxa identified by the machine learning techniques are compared to results from a standard tool (indicator species analysis) widely used by microbial ecologists. Of 1709 bacterial taxa, indicator species analysis identified 285 taxa as significant determinants of DOC concentration. Of the top 285 ranked features determined by machine learning methods, a subset of 86 taxa are common to all feature selection techniques. Using this subset of features, prediction results for random permutations of the data set are at least equally accurate compared to predictions determined using the entire feature set. Our results suggest that integration of multiple methods can aid identification of a robust subset of taxa within complex communities that may drive specific functional outcomes of interest.

摘要

微生物群落无处不在,通常会影响它们所栖息的生态系统的宏观特性。然而,由于群落的复杂性,破译特定微生物与生态系统特性之间的功能关系仍然是一个持续的挑战。这个挑战可以部分通过将 DNA 测序技术的进步与机器学习等计算方法相结合来解决。尽管机器学习技术已应用于微生物组数据,但这些技术的使用仍然很少,并且没有广泛可用的用户友好型平台来实施这些技术。我们开发了一种工具,该工具实现了神经网络和随机森林模型,可对微生物组数据执行回归和特征选择任务。在这项研究中,我们应用该工具来分析来自 44 天植物凋落叶分解实验的土壤微生物组(16S rRNA 基因谱)和溶解有机碳(DOC)数据。微生物组数据包括来自 300 多个微宇宙的 1709 个总细菌操作分类单元(OTU)。对 51 个样本的保留测试集的预测和实际 DOC 的回归分析,神经网络和随机森林方法的 Pearson 相关系数分别为 0.636 和 0.676。机器学习技术确定的重要分类群与微生物生态学家广泛使用的标准工具(指示物种分析)的结果进行了比较。在 1709 个细菌分类群中,指示物种分析确定了 285 个分类群是 DOC 浓度的重要决定因素。在机器学习方法确定的前 285 个排名特征中,有 86 个分类群是所有特征选择技术共有的。使用该特征子集,与使用整个特征集确定的预测结果相比,对数据集的随机排列进行的预测结果至少同样准确。我们的结果表明,整合多种方法可以帮助识别复杂群落中可能驱动特定感兴趣功能结果的稳健分类群子集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/035a41a55601/pone.0215502.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/3969809d88c1/pone.0215502.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/e6aa40d51adb/pone.0215502.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/9ca275f49a03/pone.0215502.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/0d486c3af098/pone.0215502.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/a8da496613b1/pone.0215502.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/035a41a55601/pone.0215502.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/3969809d88c1/pone.0215502.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/e6aa40d51adb/pone.0215502.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/9ca275f49a03/pone.0215502.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/0d486c3af098/pone.0215502.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/a8da496613b1/pone.0215502.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d82f/6602172/035a41a55601/pone.0215502.g006.jpg

相似文献

1
Machine learning to predict microbial community functions: An analysis of dissolved organic carbon from litter decomposition.机器学习预测微生物群落功能:对凋落叶分解过程中溶解有机碳的分析。
PLoS One. 2019 Jul 1;14(7):e0215502. doi: 10.1371/journal.pone.0215502. eCollection 2019.
2
Robust prediction of colorectal cancer via gut microbiome 16S rRNA sequencing data.通过肠道微生物组 16S rRNA 测序数据进行稳健的结直肠癌预测。
J Med Microbiol. 2024 Oct;73(10). doi: 10.1099/jmm.0.001903.
3
Microbial Communities Influence Soil Dissolved Organic Carbon Concentration by Altering Metabolite Composition.微生物群落通过改变代谢物组成影响土壤溶解有机碳浓度。
Front Microbiol. 2022 Jan 20;12:799014. doi: 10.3389/fmicb.2021.799014. eCollection 2021.
4
Drivers of stability and transience in composition-functioning links during serial propagation of litter-decomposing microbial communities.在连续传播分解 litter 的微生物群落的过程中,组成-功能联系的稳定性和瞬态性的驱动因素。
mSystems. 2023 Jun 29;8(3):e0122022. doi: 10.1128/msystems.01220-22. Epub 2023 May 3.
5
Soil Bacterial and Fungal Richness Forecast Patterns of Early Pine Litter Decomposition.土壤细菌和真菌丰富度预测早期松树凋落物分解模式。
Front Microbiol. 2020 Nov 6;11:542220. doi: 10.3389/fmicb.2020.542220. eCollection 2020.
6
Machine Learning Predicts Biogeochemistry from Microbial Community Structure in a Complex Model System.机器学习从复杂模型系统中的微生物群落结构预测生物地球化学。
Microbiol Spectr. 2022 Feb 23;10(1):e0190921. doi: 10.1128/spectrum.01909-21. Epub 2022 Feb 9.
7
A comparative study of the gut microbiota in immune-mediated inflammatory diseases-does a common dysbiosis exist?免疫介导的炎症性疾病的肠道微生物组比较研究——是否存在共同的菌群失调?
Microbiome. 2018 Dec 13;6(1):221. doi: 10.1186/s40168-018-0603-4.
8
Differences in substrate use linked to divergent carbon flow during litter decomposition.在凋落物分解过程中,与不同碳流相关的基质利用差异。
FEMS Microbiol Ecol. 2020 Aug 1;96(8). doi: 10.1093/femsec/fiaa135.
9
Microbial community functioning during plant litter decomposition.微生物群落在植物凋落物分解过程中的功能。
Sci Rep. 2022 May 6;12(1):7451. doi: 10.1038/s41598-022-11485-1.
10
Local Network Properties of Soil and Rhizosphere Microbial Communities in Potato Plantations Treated with a Biological Product Are Important Predictors of Crop Yield.生物制品处理下的马铃薯种植地土壤及根际微生物群落的局域网络特性是作物产量的重要预测因子。
mSphere. 2021 Aug 25;6(4):e0013021. doi: 10.1128/mSphere.00130-21. Epub 2021 Aug 11.

引用本文的文献

1
A benchmark analysis of feature selection and machine learning methods for environmental metabarcoding datasets.环境宏条形码数据集的特征选择和机器学习方法的基准分析
Comput Struct Biotechnol J. 2025 Apr 16;27:1636-1647. doi: 10.1016/j.csbj.2025.04.017. eCollection 2025.
2
Operationalizing Team Science at the Academic Cancer Center Network to Unveil the Structure and Function of the Gut Microbiome.在学术癌症中心网络中实施团队科学以揭示肠道微生物组的结构与功能。
J Clin Med. 2025 Mar 17;14(6):2040. doi: 10.3390/jcm14062040.
3
The invisible architects: microbial communities and their transformative role in soil health and global climate changes.

本文引用的文献

1
q2-sample-classifier: machine-learning tools for microbiome classification and regression.q2样本分类器:用于微生物组分类和回归的机器学习工具。
J Open Res Softw. 2018;3(30). doi: 10.21105/joss.00934. Epub 2018 Oct 23.
2
The importance of anabolism in microbial control over soil carbon storage.合成代谢在微生物对土壤碳储存的控制中的重要性。
Nat Microbiol. 2017 Jul 25;2:17105. doi: 10.1038/nmicrobiol.2017.105.
3
Metagenome-Wide Association Study and Machine Learning Prediction of Bulk Soil Microbiome and Crop Productivity.土壤宏基因组关联研究与机器学习对土壤微生物群落和作物生产力的预测
无形的建筑师:微生物群落及其在土壤健康和全球气候变化中的变革性作用。
Environ Microbiome. 2025 Mar 25;20(1):36. doi: 10.1186/s40793-025-00694-6.
4
Beyond correlation: Understanding the causal link between microbiome and plant health.超越相关性:理解微生物组与植物健康之间的因果联系。
Heliyon. 2024 Nov 19;10(23):e40517. doi: 10.1016/j.heliyon.2024.e40517. eCollection 2024 Dec 15.
5
Microbial community composition predicts bacterial production across ocean ecosystems.微生物群落组成可预测海洋生态系统中的细菌生产力。
ISME J. 2024 Jan 8;18(1). doi: 10.1093/ismejo/wrae158.
6
Gradient boosted regression as a tool to reveal key drivers of temporal dynamics in a synthetic yeast community.梯度提升回归作为一种工具,揭示了人工酵母群落时间动态的关键驱动因素。
FEMS Microbiol Ecol. 2024 Jun 17;100(7). doi: 10.1093/femsec/fiae080.
7
Proportion-based normalizations outperform compositional data transformations in machine learning applications.基于比例的归一化在机器学习应用中优于成分数据变换。
Microbiome. 2024 Mar 5;12(1):45. doi: 10.1186/s40168-023-01747-z.
8
Soil fertility impact on recruitment and diversity of the soil microbiome in sub-humid tropical pastures in Northeastern Brazil.土壤肥力对巴西东北部亚热带牧场土壤微生物组的定殖和多样性的影响。
Sci Rep. 2024 Feb 16;14(1):3919. doi: 10.1038/s41598-024-54221-7.
9
Nine (not so simple) steps: a practical guide to using machine learning in microbial ecology.九步(并非那么简单):微生物生态学中使用机器学习的实用指南。
mBio. 2024 Feb 14;15(2):e0205023. doi: 10.1128/mbio.02050-23. Epub 2023 Dec 21.
10
Association between microbiome and the development of adverse posttraumatic neuropsychiatric sequelae after traumatic stress exposure.创伤后应激暴露后微生物组与不良创伤后神经精神后遗症发展的关系。
Transl Psychiatry. 2023 Nov 18;13(1):354. doi: 10.1038/s41398-023-02643-8.
Front Microbiol. 2017 Apr 3;8:519. doi: 10.3389/fmicb.2017.00519. eCollection 2017.
4
A Machine Learning Approach for Using the Postmortem Skin Microbiome to Estimate the Postmortem Interval.一种利用死后皮肤微生物群估计死后间隔时间的机器学习方法。
PLoS One. 2016 Dec 22;11(12):e0167370. doi: 10.1371/journal.pone.0167370. eCollection 2016.
5
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.大型宏基因组数据集的机器学习荟萃分析:工具与生物学见解
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.
6
Deep Feature Selection: Theory and Application to Identify Enhancers and Promoters.深度特征选择:识别增强子和启动子的理论与应用
J Comput Biol. 2016 May;23(5):322-36. doi: 10.1089/cmb.2015.0189. Epub 2016 Jan 22.
7
A comprehensive evaluation of multicategory classification methods for microbiomic data.宏基因组数据多分类方法的综合评价。
Microbiome. 2013 Apr 5;1(1):11. doi: 10.1186/2049-2618-1-11.
8
The Genboree Microbiome Toolset and the analysis of 16S rRNA microbial sequences.Genboree 微生物组工具集和 16S rRNA 微生物序列分析。
BMC Bioinformatics. 2012;13 Suppl 13(Suppl 13):S11. doi: 10.1186/1471-2105-13-S13-S11. Epub 2012 Aug 24.
9
Predicting bacterial community assemblages using an artificial neural network approach.使用人工神经网络方法预测细菌群落组合。
Nat Methods. 2012 Apr 15;9(6):621-5. doi: 10.1038/nmeth.1975.
10
Sparse distance-based learning for simultaneous multiclass classification and feature selection of metagenomic data.基于稀疏距离的宏基因组数据同时多类分类和特征选择学习。
Bioinformatics. 2011 Dec 1;27(23):3242-9. doi: 10.1093/bioinformatics/btr547. Epub 2011 Oct 7.