• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

微生物表型特征分类器:使用机器学习技术的微生物表型特征分类器。

MICROPHERRET: MICRObial PHEnotypic tRait ClassifieR using Machine lEarning Techniques.

作者信息

Bizzotto Edoardo, Fraulini Sofia, Zampieri Guido, Orellana Esteban, Treu Laura, Campanaro Stefano

机构信息

Department of Biology, University of Padova, Padova, 35131, Italy.

出版信息

Environ Microbiome. 2024 Aug 8;19(1):58. doi: 10.1186/s40793-024-00600-6.

DOI:10.1186/s40793-024-00600-6
PMID:39113074
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11308548/
Abstract

BACKGROUND

In recent years, there has been a rapid increase in the number of microbial genomes reconstructed through shotgun sequencing, and obtained by newly developed approaches including metagenomic binning and single-cell sequencing. However, our ability to functionally characterize these genomes by experimental assays is orders of magnitude less efficient. Consequently, there is a pressing need for the development of swift and automated strategies for the functional classification of microbial genomes.

RESULTS

The present work leverages a suite of supervised machine learning algorithms to establish a range of 86 metabolic and other ecological functions, such as methanotrophy and plastic degradation, starting from widely obtainable microbial genome annotations. Tests performed on independent datasets demonstrated robust performance across complete, fragmented, and incomplete genomes above a 70% completeness level for most of the considered functions. Application of the algorithms to the Biogas Microbiome database yielded predictions broadly consistent with current biological knowledge and correctly detecting functionally-related nuances of archaeal genomes. Finally, a case study focused on acetoclastic methanogenesis demonstrated how the developed machine learning models can be refined or expanded with models describing novel functions of interest.

CONCLUSIONS

The resulting tool, MICROPHERRET, incorporates a total of 86 models, one for each tested functional class, and can be applied to high-quality microbial genomes as well as to low-quality genomes derived from metagenomics and single-cell sequencing. MICROPHERRET can thus aid in understanding the functional role of newly generated genomes within their micro-ecological context.

摘要

背景

近年来,通过鸟枪法测序重建的微生物基因组数量迅速增加,这些基因组是通过包括宏基因组分箱和单细胞测序在内的新开发方法获得的。然而,我们通过实验分析对这些基因组进行功能表征的能力效率要低几个数量级。因此,迫切需要开发快速且自动化的策略来对微生物基因组进行功能分类。

结果

目前的工作利用了一套监督机器学习算法,从广泛可获得的微生物基因组注释开始,建立了一系列86种代谢和其他生态功能,如甲烷营养和塑料降解。在独立数据集上进行的测试表明,对于大多数考虑的功能,在完整性水平高于70%的完整、片段化和不完整基因组中,该算法都具有强大的性能。将这些算法应用于沼气微生物组数据库,得到的预测结果与当前生物学知识大致一致,并正确检测出古菌基因组功能相关的细微差别。最后,一个专注于乙酸裂解产甲烷作用的案例研究展示了如何用描述感兴趣的新功能的模型来完善或扩展所开发的机器学习模型。

结论

由此产生的工具MICROPHERRET总共包含86个模型,每个测试的功能类别一个,可应用于高质量的微生物基因组以及源自宏基因组学和单细胞测序的低质量基因组。因此,MICROPHERRET有助于在微生态背景下理解新生成基因组的功能作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/ad0c8caa2e92/40793_2024_600_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/b4e2d71f8d41/40793_2024_600_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/1c76f38daf31/40793_2024_600_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/8e7734f593e6/40793_2024_600_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/d6511853ae37/40793_2024_600_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/ad0c8caa2e92/40793_2024_600_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/b4e2d71f8d41/40793_2024_600_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/1c76f38daf31/40793_2024_600_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/8e7734f593e6/40793_2024_600_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/d6511853ae37/40793_2024_600_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65ca/11308548/ad0c8caa2e92/40793_2024_600_Fig5_HTML.jpg

相似文献

1
MICROPHERRET: MICRObial PHEnotypic tRait ClassifieR using Machine lEarning Techniques.微生物表型特征分类器:使用机器学习技术的微生物表型特征分类器。
Environ Microbiome. 2024 Aug 8;19(1):58. doi: 10.1186/s40793-024-00600-6.
2
Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.评估宏基因组工具在真实宏基因组数据集和 CAMI 数据集上的基因组 binning 效果。
BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.
3
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
4
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
5
Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics.通过单细胞基因组学和宏基因组学的整合框架从人类微生物组中恢复菌株解析基因组。
Microbiome. 2021 Oct 12;9(1):202. doi: 10.1186/s40168-021-01152-4.
6
Metagenomic analysis and functional characterization of the biogas microbiome using high throughput shotgun sequencing and a novel binning strategy.利用高通量鸟枪法测序和一种新型分箱策略对沼气微生物群落进行宏基因组分析和功能表征。
Biotechnol Biofuels. 2016 Feb 2;9:26. doi: 10.1186/s13068-016-0441-1. eCollection 2016.
7
Human reference gut microbiome catalog including newly assembled genomes from under-represented Asian metagenomes.人类参考肠道微生物组目录,包括来自代表性不足的亚洲宏基因组的新组装基因组。
Genome Med. 2021 Aug 27;13(1):134. doi: 10.1186/s13073-021-00950-7.
8
MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.MaxBin 2.0:一种从多个宏基因组数据集中恢复基因组的自动分箱算法。
Bioinformatics. 2016 Feb 15;32(4):605-7. doi: 10.1093/bioinformatics/btv638. Epub 2015 Oct 29.
9
Deeper insight into the structure of the anaerobic digestion microbial community; the biogas microbiome database is expanded with 157 new genomes.深入了解厌氧消化微生物群落的结构;沼气微生物组数据库扩展了 157 个新基因组。
Bioresour Technol. 2016 Sep;216:260-6. doi: 10.1016/j.biortech.2016.05.081. Epub 2016 May 24.
10
Gene-based microbiome representation enhances host phenotype classification.基于基因的微生物组表示增强了宿主表型分类。
mSystems. 2023 Aug 31;8(4):e0053123. doi: 10.1128/msystems.00531-23. Epub 2023 Jul 5.

本文引用的文献

1
A unified compendium of prokaryotic and viral genomes from over 300 anaerobic digestion microbiomes.来自300多个厌氧消化微生物群落的原核生物和病毒基因组统一汇编。
Environ Microbiome. 2024 Jan 2;19(1):1. doi: 10.1186/s40793-023-00545-2.
2
Statistical prediction of microbial metabolic traits from genomes.从基因组预测微生物代谢特性的统计方法。
PLoS Comput Biol. 2023 Dec 19;19(12):e1011705. doi: 10.1371/journal.pcbi.1011705. eCollection 2023 Dec.
3
Patterns of bacterial communities in the rhizosphere and rhizoplane of alpine wet meadows.
高山湿地根际和根面细菌群落的模式。
Environ Res. 2024 Jan 15;241:117672. doi: 10.1016/j.envres.2023.117672. Epub 2023 Nov 18.
4
MGnify Genomes: A Resource for Biome-specific Microbial Genome Catalogues.MGnify 基因组:用于生物群落特异性微生物基因组目录的资源。
J Mol Biol. 2023 Jul 15;435(14):168016. doi: 10.1016/j.jmb.2023.168016. Epub 2023 Feb 16.
5
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification.马修斯相关系数(MCC)应取代受试者工作特征曲线下面积(ROC AUC),作为评估二元分类的标准指标。
BioData Min. 2023 Feb 17;16(1):4. doi: 10.1186/s13040-023-00322-4.
6
Machine learning classifiers predict key genomic and evolutionary traits across the kingdoms of life.机器学习分类器可预测生命王国的关键基因组和进化特征。
Sci Rep. 2023 Feb 6;13(1):2088. doi: 10.1038/s41598-023-28965-7.
7
Genomic remnants of ancestral methanogenesis and hydrogenotrophy in Archaea drive anaerobic carbon cycling.古菌中祖先产甲烷作用和氢营养作用的基因组残余驱动厌氧碳循环。
Sci Adv. 2022 Nov 4;8(44):eabm9651. doi: 10.1126/sciadv.abm9651.
8
KEGG for taxonomy-based analysis of pathways and genomes.KEGG 用于基于分类的途径和基因组分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D587-D592. doi: 10.1093/nar/gkac963.
9
Bacterial Keystone Taxa Regulate Carbon Metabolism in the Earthworm Gut.细菌关键分类群调控蚯蚓肠道中的碳代谢。
Microbiol Spectr. 2022 Oct 26;10(5):e0108122. doi: 10.1128/spectrum.01081-22. Epub 2022 Aug 16.
10
Diversity and Evolution of Methane-Related Pathways in Archaea.古菌中甲烷相关途径的多样性与演化。
Annu Rev Microbiol. 2022 Sep 8;76:727-755. doi: 10.1146/annurev-micro-041020-024935. Epub 2022 Jun 27.