• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

构建用于微生物组数据分析的系统发育树:一篇综述短文

Constructing phylogenetic trees for microbiome data analysis: A mini-review.

作者信息

Liu Ruitao, Qiao Xi, Shi Yushu, Peterson Christine B, Bush William S, Cominelli Fabio, Wang Ming, Zhang Liangliang

机构信息

Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, 44106, OH, United States.

Weill Cornell Medicine, Cornell University, 1300 York Ave, New York, 10065, NY, United States.

出版信息

Comput Struct Biotechnol J. 2024 Oct 24;23:3859-3868. doi: 10.1016/j.csbj.2024.10.032. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.10.032
PMID:39554614
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11564040/
Abstract

As next-generation sequencing technologies advance rapidly and the cost of metagenomic sequencing continues to decrease, researchers now face an unprecedented volume of microbiome data. This surge has stimulated the development of scalable microbiome data analysis methods and necessitated the incorporation of phylogenetic information into microbiome analysis for improved accuracy. Tools for constructing phylogenetic trees from 16S rRNA sequencing data are well-established, as the highly conserved regions of the 16S gene are limited, simplifying the identification of marker genes. In contrast, metagenomic and whole genome shotgun (WGS) sequencing involve sequencing from random fragments of the entire gene, making identification of consistent marker genes challenging owing to the vast diversity of genomic regions, resulting in a scarcity of robust tools for constructing phylogenetic trees. Although bacterial sequence tree construction tools exist for upstream bioinformatics, many downstream researchers-those integrating these trees into statistical models or machine learning-are either unaware of these tools or find them difficult to use due to the steep learning curve of processing raw sequences. This is compounded by the fact that public datasets often lack phylogenetic trees, providing only abundance tables and taxonomic classifications. To address this, we present a comprehensive review of phylogenetic tree construction techniques for microbiome data (16S rRNA or whole-genome shotgun sequencing). We outline the strengths and limitations of current methods, offering expert insights and step-by-step guidance to make these tools more accessible and widely applicable in quantitative microbiome data analysis.

摘要

随着下一代测序技术的迅速发展以及宏基因组测序成本的持续下降,研究人员如今面临着前所未有的大量微生物组数据。这种激增刺激了可扩展的微生物组数据分析方法的发展,并且有必要将系统发育信息纳入微生物组分析以提高准确性。从16S rRNA测序数据构建系统发育树的工具已经很成熟,因为16S基因的高度保守区域有限,简化了标记基因的识别。相比之下,宏基因组测序和全基因组鸟枪法测序涉及对整个基因的随机片段进行测序,由于基因组区域的巨大多样性,使得识别一致的标记基因具有挑战性,导致用于构建系统发育树的强大工具匮乏。尽管存在用于上游生物信息学的细菌序列树构建工具,但许多下游研究人员——即将这些树整合到统计模型或机器学习中的研究人员——要么不知道这些工具,要么由于处理原始序列的学习曲线陡峭而觉得难以使用。公共数据集通常缺乏系统发育树,仅提供丰度表和分类学分类,这使得情况更加复杂。为了解决这个问题,我们对微生物组数据(16S rRNA或全基因组鸟枪法测序)的系统发育树构建技术进行了全面综述。我们概述了当前方法的优缺点,提供专家见解和逐步指导,以使这些工具在定量微生物组数据分析中更易于使用和广泛应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/3f65230c590a/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/44141e5cafb8/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/9008f40689c7/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/3f65230c590a/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/44141e5cafb8/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/9008f40689c7/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbc0/11564040/3f65230c590a/gr003.jpg

相似文献

1
Constructing phylogenetic trees for microbiome data analysis: A mini-review.构建用于微生物组数据分析的系统发育树:一篇综述短文
Comput Struct Biotechnol J. 2024 Oct 24;23:3859-3868. doi: 10.1016/j.csbj.2024.10.032. eCollection 2024 Dec.
2
MicroPredict: predicting species-level taxonomic abundance of whole-shotgun metagenomic data using only 16S amplicon sequencing data.MicroPredict:仅使用 16S 扩增子测序数据预测全基因组宏基因组数据的种级分类丰度。
Genes Genomics. 2024 Jun;46(6):701-712. doi: 10.1007/s13258-024-01514-w. Epub 2024 May 3.
3
Comparison of 16S and whole genome dog microbiomes using machine learning.使用机器学习对16S和全基因组犬微生物群进行比较。
BioData Min. 2021 Aug 21;14(1):41. doi: 10.1186/s13040-021-00270-x.
4
VITCOMIC2: visualization tool for the phylogenetic composition of microbial communities based on 16S rRNA gene amplicons and metagenomic shotgun sequencing.VITCOMIC2:基于16S rRNA基因扩增子和宏基因组鸟枪法测序的微生物群落系统发育组成可视化工具。
BMC Syst Biol. 2018 Mar 19;12(Suppl 2):30. doi: 10.1186/s12918-018-0545-2.
5
Instruction of microbiome taxonomic profiling based on 16S rRNA sequencing.基于 16S rRNA 测序的微生物组分类分析说明。
J Microbiol. 2020 Mar;58(3):193-205. doi: 10.1007/s12275-020-9556-y. Epub 2020 Feb 27.
6
Evaluation of CRISPR Diversity in the Human Skin Microbiome for Personal Identification.用于个体识别的人类皮肤微生物组中CRISPR多样性的评估。
mSystems. 2021 Feb 2;6(1):e01255-20. doi: 10.1128/mSystems.01255-20.
7
Evaluating the Information Content of Shallow Shotgun Metagenomics.评估浅层鸟枪法宏基因组学的信息含量。
mSystems. 2018 Nov 13;3(6). doi: 10.1128/mSystems.00069-18. eCollection 2018 Nov-Dec.
8
Characterization of the Gut Microbiome Using 16S or Shotgun Metagenomics.使用16S或鸟枪法宏基因组学对肠道微生物组进行表征。
Front Microbiol. 2016 Apr 20;7:459. doi: 10.3389/fmicb.2016.00459. eCollection 2016.
9
Piphillin predicts metagenomic composition and dynamics from DADA2-corrected 16S rDNA sequences.Piphillin 可根据 DADA2 校正的 16S rDNA 序列预测宏基因组组成和动态。
BMC Genomics. 2020 Jan 17;21(1):56. doi: 10.1186/s12864-019-6427-1.
10
A Bayesian taxonomic classification method for 16S rRNA gene sequences with improved species-level accuracy.一种用于16S rRNA基因序列的贝叶斯分类方法,具有更高的物种水平准确性。
BMC Bioinformatics. 2017 May 10;18(1):247. doi: 10.1186/s12859-017-1670-4.

本文引用的文献

1
Scaling DEPP phylogenetic placement to ultra-large reference trees: a tree-aware ensemble approach.将 DEPP 系统发育定位扩展到超大规模参考树:一种基于树的集成方法。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae361.
2
Greengenes2 unifies microbial data in a single reference tree.Greengenes2 将微生物数据统一在一个单一的参考树中。
Nat Biotechnol. 2024 May;42(5):715-718. doi: 10.1038/s41587-023-01845-1. Epub 2023 Jul 27.
3
PhyloMed: a phylogeny-based test of mediation effect in microbiome.PhyloMed:基于系统发育的微生物组中介效应检验方法。
Genome Biol. 2023 Apr 11;24(1):72. doi: 10.1186/s13059-023-02902-3.
4
Sparse tree-based clustering of microbiome data to characterize microbiome heterogeneity in pancreatic cancer.基于稀疏树的微生物组数据聚类以表征胰腺癌中的微生物组异质性。
J R Stat Soc Ser C Appl Stat. 2023 Jan;72(1):20-36. doi: 10.1093/jrsssc/qlac002. Epub 2023 Feb 13.
5
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4.利用 MetaPhlAn 4 对未鉴定物种进行宏基因组分类分析的扩展和改进。
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
6
Cultivation-independent genomes greatly expand taxonomic-profiling capabilities of mOTUs across various environments.非培养基因组极大地扩展了 mOTU 在各种环境中的分类鉴定能力。
Microbiome. 2022 Dec 5;10(1):212. doi: 10.1186/s40168-022-01410-z.
7
Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade.基于系统发育定位的宏基因组分析——首个十年综述
Front Bioinform. 2022 May 26;2:871393. doi: 10.3389/fbinf.2022.871393. eCollection 2022.
8
LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis.LotuS2:一种用于扩增子测序分析的超快速、高度准确的工具。
Microbiome. 2022 Oct 19;10(1):176. doi: 10.1186/s40168-022-01365-1.
9
DEPP: Deep Learning Enables Extending Species Trees using Single Genes.DEPP:深度学习可利用单基因拓展物种树。
Syst Biol. 2023 May 19;72(1):17-34. doi: 10.1093/sysbio/syac031.
10
Phylogeny-Aware Analysis of Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing Taxonomy.基于匹配参考基因组绕过分类学的宏基因组群落生态学的系统发育分析。
mSystems. 2022 Apr 26;7(2):e0016722. doi: 10.1128/msystems.00167-22. Epub 2022 Apr 4.