• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCRAPP:一种用于评估基于系统发育定位的微生物样本多样性的工具。

SCRAPP: A tool to assess the diversity of microbial samples from phylogenetic placements.

机构信息

Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.

Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany.

出版信息

Mol Ecol Resour. 2021 Jan;21(1):340-349. doi: 10.1111/1755-0998.13255. Epub 2020 Oct 9.

DOI:10.1111/1755-0998.13255
PMID:32996237
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7756409/
Abstract

Microbial ecology research is currently driven by the continuously decreasing cost of DNA sequencing and the improving accuracy of data analysis methods. One such analysis method is phylogenetic placement, which establishes the phylogenetic identity of the anonymous environmental sequences in a sample by means of a given phylogenetic reference tree. However, assessing the diversity of a sample remains challenging, as traditional methods do not scale well with the increasing data volumes and/or do not leverage the phylogenetic placement information. Here, we present scrapp, a highly parallel and scalable tool that uses a molecular species delimitation algorithm to quantify the diversity distribution over the reference phylogeny for a given phylogenetic placement of the sample. scrapp employs a novel approach to cluster phylogenetic placements, called placement space clustering, to efficiently perform dimensionality reduction, so as to scale on large data volumes. Furthermore, it uses the phylogeny-aware molecular species delimitation method mPTP to quantify diversity. We evaluated scrapp using both, simulated and empirical data sets. We use simulated data to verify our approach. Tests on an empirical data set show that scrapp-derived metrics can classify samples by their diversity-correlated features equally well or better than existing, commonly used approaches. scrapp is available at https://github.com/pbdas/scrapp.

摘要

微生物生态学研究目前受到 DNA 测序成本不断降低和数据分析方法准确性不断提高的推动。其中一种分析方法是系统发育定位,它通过给定的系统发育参考树来确定样本中匿名环境序列的系统发育身份。然而,评估样本的多样性仍然具有挑战性,因为传统方法无法很好地扩展到不断增加的数据量,或者无法利用系统发育定位信息。在这里,我们介绍了 scrapp,这是一种高度并行和可扩展的工具,它使用分子物种划分算法来量化给定样本系统发育定位的参考系统发育上的多样性分布。scrapp 采用了一种称为放置空间聚类的新方法来对聚类进行聚类,以有效地进行降维,从而适应大数据量。此外,它还使用了基于亲缘关系的分子物种划分方法 mPTP 来量化多样性。我们使用模拟数据集和经验数据集来评估 scrapp。我们使用模拟数据来验证我们的方法。对经验数据集的测试表明,scrapp 衍生的指标可以根据其与多样性相关的特征同等或更好地对样本进行分类,而不是现有的常用方法。scrapp 可在 https://github.com/pbdas/scrapp 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/050c7692bf9f/MEN-21-340-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/d28a281d13ea/MEN-21-340-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/a28f58da209d/MEN-21-340-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/050c7692bf9f/MEN-21-340-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/d28a281d13ea/MEN-21-340-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/a28f58da209d/MEN-21-340-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db6c/7756409/050c7692bf9f/MEN-21-340-g003.jpg

相似文献

1
SCRAPP: A tool to assess the diversity of microbial samples from phylogenetic placements.SCRAPP:一种用于评估基于系统发育定位的微生物样本多样性的工具。
Mol Ecol Resour. 2021 Jan;21(1):340-349. doi: 10.1111/1755-0998.13255. Epub 2020 Oct 9.
2
EPIK: precise and scalable evolutionary placement with informative k-mers.EPIK:带信息 k- -mer 的精确可扩展进化放置。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad692.
3
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.幽灵树:用于多样性分析的杂种基因系统发育树的构建。
Microbiome. 2016 Feb 24;4:11. doi: 10.1186/s40168-016-0153-6.
4
A general species delimitation method with applications to phylogenetic placements.一种通用的物种界定方法及其在系统发育定位中的应用。
Bioinformatics. 2013 Nov 15;29(22):2869-76. doi: 10.1093/bioinformatics/btt499. Epub 2013 Aug 29.
5
Methods for automatic reference trees and multilevel phylogenetic placement.自动引用树和多层次系统发育定位方法。
Bioinformatics. 2019 Apr 1;35(7):1151-1158. doi: 10.1093/bioinformatics/bty767.
6
SEPP: SATé-enabled phylogenetic placement.SEPP:基于SATé的系统发育定位
Pac Symp Biocomput. 2012:247-58. doi: 10.1142/9789814366496_0024.
7
Phylogenetic double placement of mixed samples.混合样本的系统发育双重定位。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i335-i343. doi: 10.1093/bioinformatics/btaa489.
8
EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences.EPA-ng:大规模并行遗传序列布局进化。
Syst Biol. 2019 Mar 1;68(2):365-369. doi: 10.1093/sysbio/syy054.
9
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
10
Fast and accurate distance-based phylogenetic placement using divide and conquer.基于划分与征服的快速准确基于距离的系统发育定位方法
Mol Ecol Resour. 2022 Apr;22(3):1213-1227. doi: 10.1111/1755-0998.13527. Epub 2021 Oct 26.

引用本文的文献

1
Scalable method for exploring phylogenetic placement uncertainty with custom visualizations using and .使用[具体工具1]和[具体工具2]通过自定义可视化探索系统发育位置不确定性的可扩展方法。
Imeta. 2025 Jan 12;4(1):e269. doi: 10.1002/imt2.269. eCollection 2025 Feb.
2
Functional diversity of bacterial microbiota associated with the toxigenic benthic dinoflagellate Prorocentrum.与产毒底栖甲藻夜光藻相关的细菌微生物群落的功能多样性。
PLoS One. 2024 Jul 16;19(7):e0306108. doi: 10.1371/journal.pone.0306108. eCollection 2024.
3
EPIK: precise and scalable evolutionary placement with informative k-mers.

本文引用的文献

1
Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data.Genesis 和 Gappa:处理、分析和可视化系统发育(位置)数据。
Bioinformatics. 2020 May 1;36(10):3263-3265. doi: 10.1093/bioinformatics/btaa070.
2
Long-read metabarcoding of the eukaryotic rDNA operon to phylogenetically and taxonomically resolve environmental diversity.长读元条形码技术解析真核 rDNA 操纵子以解决环境多样性的系统发育和分类学问题。
Mol Ecol Resour. 2020 Mar;20(2):429-443. doi: 10.1111/1755-0998.13117. Epub 2019 Nov 29.
3
Scalable methods for analyzing and visualizing phylogenetic placement of metagenomic samples.
EPIK:带信息 k- -mer 的精确可扩展进化放置。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad692.
4
App-SpaM: phylogenetic placement of short reads without sequence alignment.App-SpaM:无需序列比对的短读段系统发育定位
Bioinform Adv. 2021 Oct 13;1(1):vbab027. doi: 10.1093/bioadv/vbab027. eCollection 2021.
5
Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade.基于系统发育定位的宏基因组分析——首个十年综述
Front Bioinform. 2022 May 26;2:871393. doi: 10.3389/fbinf.2022.871393. eCollection 2022.
可扩展的方法来分析和可视化宏基因组样本的系统发育定位。
PLoS One. 2019 May 28;14(5):e0217050. doi: 10.1371/journal.pone.0217050. eCollection 2019.
4
RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.RAxML-NG:用于最大似然系统发育推断的快速、可扩展和用户友好的工具。
Bioinformatics. 2019 Nov 1;35(21):4453-4455. doi: 10.1093/bioinformatics/btz305.
5
ParGenes: a tool for massively parallel model selection and phylogenetic tree inference on thousands of genes.ParGenes:一个用于在数千个基因上进行大规模并行模型选择和系统发育树推断的工具。
Bioinformatics. 2019 May 15;35(10):1771-1773. doi: 10.1093/bioinformatics/bty839.
6
EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences.EPA-ng:大规模并行遗传序列布局进化。
Syst Biol. 2019 Mar 1;68(2):365-369. doi: 10.1093/sysbio/syy054.
7
Parasites dominate hyperdiverse soil protist communities in Neotropical rainforests.寄生虫在新热带雨林中高度多样化的土壤原生生物群落中占主导地位。
Nat Ecol Evol. 2017 Mar 20;1(4):91. doi: 10.1038/s41559-017-0091.
8
UniEuk: Time to Speak a Common Language in Protistology!单真核生物:原生生物学领域需要统一术语!
J Eukaryot Microbiol. 2017 May;64(3):407-411. doi: 10.1111/jeu.12414. Epub 2017 Apr 21.
9
Multi-rate Poisson tree processes for single-locus species delimitation under maximum likelihood and Markov chain Monte Carlo.基于最大似然法和马尔可夫链蒙特卡罗法的单基因座物种界定的多速率泊松树过程
Bioinformatics. 2017 Jun 1;33(11):1630-1638. doi: 10.1093/bioinformatics/btx025.
10
VSEARCH: a versatile open source tool for metagenomics.VSEARCH:一款用于宏基因组学的多功能开源工具。
PeerJ. 2016 Oct 18;4:e2584. doi: 10.7717/peerj.2584. eCollection 2016.