• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

宏基因组功能谱分析:描绘还是不描绘?

Metagenomic functional profiling: to sketch or not to sketch?

机构信息

School of Electrical Engineering and Computer Science, Pennsylvania State University, University Park, Pennsylvania 16802, United States.

Bioinformatics and Genomics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802, United States.

出版信息

Bioinformatics. 2024 Sep 1;40(Suppl 2):ii165-ii173. doi: 10.1093/bioinformatics/btae397.

DOI:10.1093/bioinformatics/btae397
PMID:39230701
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11373326/
Abstract

MOTIVATION

Functional profiling of metagenomic samples is essential to decipher the functional capabilities of microbial communities. Traditional and more widely used functional profilers in the context of metagenomics rely on aligning reads against a known reference database. However, aligning sequencing reads against a large and fast-growing database is computationally expensive. In general, k-mer-based sketching techniques have been successfully used in metagenomics to address this bottleneck, notably in taxonomic profiling. In this work, we describe leveraging FracMinHash (implemented in sourmash, a publicly available software), a k-mer-sketching algorithm, to obtain functional profiles of metagenome samples.

RESULTS

We show how pieces of the sourmash software (and the resulting FracMinHash sketches) can be put together in a pipeline to functionally profile a metagenomic sample. We named our pipeline fmh-funprofiler. We report that the functional profiles obtained using this pipeline demonstrate comparable completeness and better purity compared to the profiles obtained using other alignment-based methods when applied to simulated metagenomic data. We also report that fmh-funprofiler is 39-99× faster in wall-clock time, and consumes up to 40-55× less memory. Coupled with the KEGG database, this method not only replicates fundamental biological insights but also highlights novel signals from the Human Microbiome Project datasets.

AVAILABILITY AND IMPLEMENTATION

This fast and lightweight metagenomic functional profiler is freely available and can be accessed here: https://github.com/KoslickiLab/fmh-funprofiler. All scripts of the analyses we present in this manuscript can be found on GitHub.

摘要

动机

对宏基因组样本进行功能分析对于破译微生物群落的功能能力至关重要。传统的和更广泛使用的宏基因组学中的功能分析器依赖于将读取序列与已知参考数据库进行比对。然而,将测序读取序列与大型且快速增长的数据库进行比对在计算上是非常昂贵的。一般来说,基于 k-mer 的草图技术已成功应用于宏基因组学中,以解决这一瓶颈问题,特别是在分类学分析中。在这项工作中,我们描述了利用 FracMinHash(在开源软件 sourmash 中实现),一种 k-mer 草图算法,来获取宏基因组样本的功能图谱。

结果

我们展示了如何将 sourmash 软件的部分内容(以及由此产生的 FracMinHash 草图)组合成一个管道,以对宏基因组样本进行功能分析。我们将这个管道命名为 fmh-funprofiler。我们报告说,当应用于模拟宏基因组数据时,与其他基于比对的方法相比,使用该管道获得的功能图谱具有相当的完整性和更好的纯度。我们还报告说,fmh-funprofiler 在处理时间上快 39-99 倍,消耗的内存少 40-55 倍。与 KEGG 数据库结合使用,该方法不仅复制了基本的生物学见解,还突出了人类微生物组计划数据集的新信号。

可用性和实现

这个快速且轻量级的宏基因组功能分析器是免费提供的,可以在这里访问:https://github.com/KoslickiLab/fmh-funprofiler。我们在本文中呈现的所有分析脚本都可以在 GitHub 上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/fb37b224a1e5/btae397f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/7d88e0c3c7ca/btae397f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/cdc49099ecae/btae397f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/9c7e66de9edb/btae397f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/fb37b224a1e5/btae397f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/7d88e0c3c7ca/btae397f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/cdc49099ecae/btae397f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/9c7e66de9edb/btae397f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95af/11373326/fb37b224a1e5/btae397f4.jpg

相似文献

1
Metagenomic functional profiling: to sketch or not to sketch?宏基因组功能谱分析:描绘还是不描绘?
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii165-ii173. doi: 10.1093/bioinformatics/btae397.
2
KMCP: accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping.KMCP:通过伪映射对原核生物和病毒种群进行准确的宏基因组分析。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac845.
3
AFITbin: a metagenomic contig binning method using aggregate l-mer frequency based on initial and terminal nucleotides.AfiTbin:一种基于初始和末端核苷酸的基于聚合 l-mer 频率的宏基因组序列拼接方法。
BMC Bioinformatics. 2024 Jul 16;25(1):241. doi: 10.1186/s12859-024-05859-7.
4
CMash: fast, multi-resolution estimation of k-mer-based Jaccard and containment indices.CMash:基于 k-mer 的 Jaccard 和包含指数的快速、多分辨率估计。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i28-i35. doi: 10.1093/bioinformatics/btac237.
5
Struo: a pipeline for building custom databases for common metagenome profilers.Struo:用于为常见宏基因组分析器构建自定义数据库的管道。
Bioinformatics. 2020 Apr 1;36(7):2314-2315. doi: 10.1093/bioinformatics/btz899.
6
Cosine Similarity Estimation Using FracMinHash: Theoretical Analysis, Safety Conditions, and Implementation.使用FracMinHash的余弦相似度估计:理论分析、安全条件及实现
bioRxiv. 2024 May 30:2024.05.24.595805. doi: 10.1101/2024.05.24.595805.
7
CONSULT-II: accurate taxonomic identification and profiling using locality-sensitive hashing.CONSULT-II:基于位置敏感哈希的准确分类鉴定和特征分析。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae150.
8
CAMISIM: simulating metagenomes and microbial communities.CAMISIM:模拟宏基因组和微生物群落。
Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.
9
Cultivation-independent genomes greatly expand taxonomic-profiling capabilities of mOTUs across various environments.非培养基因组极大地扩展了 mOTU 在各种环境中的分类鉴定能力。
Microbiome. 2022 Dec 5;10(1):212. doi: 10.1186/s40168-022-01410-z.
10
Re-purposing software for functional characterization of the microbiome.重新利用软件对微生物组进行功能特征分析。
Microbiome. 2021 Jan 9;9(1):4. doi: 10.1186/s40168-020-00971-1.

本文引用的文献

1
Washed Microbiota Transplantation Improves Patients with Overweight by the Gut Microbiota and Sphingolipid Metabolism.经肠道微生物群和鞘脂代谢的洗肠微生物群移植可改善超重患者的状况。
Biomedicines. 2023 Aug 29;11(9):2415. doi: 10.3390/biomedicines11092415.
2
Gut microbial carbohydrate metabolism contributes to insulin resistance.肠道微生物的碳水化合物代谢有助于胰岛素抵抗。
Nature. 2023 Sep;621(7978):389-395. doi: 10.1038/s41586-023-06466-x. Epub 2023 Aug 30.
3
Deriving confidence intervals for mutation rates across a wide range of evolutionary distances using FracMinHash.
使用 FracMinHash 在广泛的进化距离范围内推导突变率的置信区间。
Genome Res. 2023 Jul;33(7):1061-1068. doi: 10.1101/gr.277651.123. Epub 2023 Jun 21.
4
Mucin glycans and their degradation by gut microbiota.粘蛋白糖及其被肠道微生物群降解。
Glycoconj J. 2023 Aug;40(4):493-512. doi: 10.1007/s10719-023-10124-9. Epub 2023 Jun 15.
5
The roles of dietary lipids and lipidomics in gut-brain axis in type 2 diabetes mellitus.膳食脂质和脂质组学在 2 型糖尿病的肠脑轴中的作用。
J Transl Med. 2023 Apr 2;21(1):240. doi: 10.1186/s12967-023-04088-5.
6
Metabolic modelling of the human gut microbiome in type 2 diabetes patients in response to metformin treatment.2 型糖尿病患者二甲双胍治疗反应的肠道微生物组代谢建模。
NPJ Syst Biol Appl. 2023 Jan 21;9(1):2. doi: 10.1038/s41540-022-00261-6.
7
Human Gut Microbiota and Drug Metabolism.人体肠道微生物群与药物代谢。
Microb Ecol. 2023 Jul;86(1):97-111. doi: 10.1007/s00248-022-02081-x. Epub 2022 Jul 23.
8
CMash: fast, multi-resolution estimation of k-mer-based Jaccard and containment indices.CMash:基于 k-mer 的 Jaccard 和包含指数的快速、多分辨率估计。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i28-i35. doi: 10.1093/bioinformatics/btac237.
9
Critical Assessment of Metagenome Interpretation: the second round of challenges.宏基因组解读的关键评估:第二轮挑战。
Nat Methods. 2022 Apr;19(4):429-440. doi: 10.1038/s41592-022-01431-4. Epub 2022 Apr 8.
10
Metagenomic Analysis Reveals a Changing Microbiome Associated With the Depth of Invasion of Oral Squamous Cell Carcinoma.宏基因组分析揭示了与口腔鳞状细胞癌侵袭深度相关的不断变化的微生物群。
Front Microbiol. 2022 Feb 9;13:795777. doi: 10.3389/fmicb.2022.795777. eCollection 2022.