宏基因组分析工具的准确性和速度评估。

An evaluation of the accuracy and speed of metagenome analysis tools.

作者信息

Lindgreen Stinus, Adair Karen L, Gardner Paul P

机构信息

Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand.

School of Biological Sciences, University of Canterbury, Christchurch, New Zealand.

出版信息

Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.

DOI:10.1038/srep19233

PMID:26778510

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4726098/

Abstract

Metagenome studies are becoming increasingly widespread, yielding important insights into microbial communities covering diverse environments from terrestrial and aquatic ecosystems to human skin and gut. With the advent of high-throughput sequencing platforms, the use of large scale shotgun sequencing approaches is now commonplace. However, a thorough independent benchmark comparing state-of-the-art metagenome analysis tools is lacking. Here, we present a benchmark where the most widely used tools are tested on complex, realistic data sets. Our results clearly show that the most widely used tools are not necessarily the most accurate, that the most accurate tool is not necessarily the most time consuming, and that there is a high degree of variability between available tools. These findings are important as the conclusions of any metagenomics study are affected by errors in the predicted community composition and functional capacity. Data sets and results are freely available from http://www.ucbioinformatics.org/metabenchmark.html.

摘要

宏基因组研究正变得越来越普遍，为微生物群落提供了重要见解，这些群落涵盖了从陆地和水生生态系统到人类皮肤和肠道等各种环境。随着高通量测序平台的出现，大规模鸟枪法测序方法的使用现在已经很常见。然而，缺乏对最先进的宏基因组分析工具进行全面独立的基准测试。在这里，我们展示了一个基准测试，其中最广泛使用的工具在复杂、现实的数据集上进行了测试。我们的结果清楚地表明，使用最广泛的工具不一定是最准确的，最准确的工具不一定是最耗时的，并且现有工具之间存在高度变异性。这些发现很重要，因为任何宏基因组学研究的结论都会受到预测群落组成和功能能力错误的影响。数据集和结果可从http://www.ucbioinformatics.org/metabenchmark.html免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a627/4726098/c7fc208ecc5f/srep19233-f1.jpg

相似文献

An evaluation of the accuracy and speed of metagenome analysis tools.

Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.

CAMISIM: simulating metagenomes and microbial communities.

Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.

Practical considerations for sampling and data analysis in contemporary metagenomics-based environmental studies.

J Microbiol Methods. 2018 Nov;154:14-18. doi: 10.1016/j.mimet.2018.09.020. Epub 2018 Oct 1.

Quantitative Assessment of Shotgun Metagenomics and 16S rDNA Amplicon Sequencing in the Study of Human Gut Microbiome.

OMICS. 2018 Apr;22(4):248-254. doi: 10.1089/omi.2018.0013.

Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.

PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.

Bioinformatics for NGS-based metagenomics and the application to biogas research.

J Biotechnol. 2017 Nov 10;261:10-23. doi: 10.1016/j.jbiotec.2017.08.012. Epub 2017 Aug 18.

Microbial community analysis using high-throughput sequencing technology: a beginner's guide for microbiologists.

J Microbiol. 2020 Mar;58(3):176-192. doi: 10.1007/s12275-020-9525-5. Epub 2020 Feb 27.

Species classifier choice is a key consideration when analysing low-complexity food microbiome data.

Microbiome. 2018 Mar 20;6(1):50. doi: 10.1186/s40168-018-0437-0.

MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics.

BMC Bioinformatics. 2015 Feb 28;16(1):65. doi: 10.1186/s12859-015-0501-8.

An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS.

Methods Mol Biol. 2017;1611:35-44. doi: 10.1007/978-1-4939-7015-5_4.

引用本文的文献

Impact of Parenteral Ceftiofur on Developmental Dynamics of Early Life Fecal Microbiota and Antibiotic Resistome in Neonatal Lambs.

Antibiotics (Basel). 2025 Apr 25;14(5):434. doi: 10.3390/antibiotics14050434.

Diversified Soil Types Differentially Regulated the Peanut ( L.) Growth and Rhizosphere Bacterial Community Structure.

Plants (Basel). 2025 Apr 9;14(8):1169. doi: 10.3390/plants14081169.

Bracken: estimating species abundance in metagenomics data.

PeerJ Comput Sci. 2017;3. doi: 10.7717/peerj-cs.104. Epub 2017 Jan 2.

A review of neural networks for metagenomic binning.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf065.

MetaComBin: combining abundances and overlaps for binning metagenomics reads.

Front Bioinform. 2025 Mar 3;5:1504728. doi: 10.3389/fbinf.2025.1504728. eCollection 2025.

Impact of simulation and reference catalogues on the evaluation of taxonomic profiling pipelines.

Microb Genom. 2025 Jan;11(1). doi: 10.1099/mgen.0.001330.

Assessing the de novo assemblers: a metaviromic study of apple and first report of citrus concave gum-associated virus, apple rubbery wood virus 1 and 2 infecting apple in India.

BMC Genomics. 2024 Nov 8;25(1):1057. doi: 10.1186/s12864-024-10968-x.

Design and implementation of a metagenomic analytical pipeline for respiratory pathogen detection.

BMC Res Notes. 2024 Oct 3;17(1):291. doi: 10.1186/s13104-024-06964-9.

Taxanorm: a novel taxa-specific normalization approach for microbiome data.

BMC Bioinformatics. 2024 Sep 16;25(1):304. doi: 10.1186/s12859-024-05918-z.

CAIM: coverage-based analysis for identification of microbiome.

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.

本文引用的文献

CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.

BMC Genomics. 2015 Mar 25;16(1):236. doi: 10.1186/s12864-015-1419-2.

Accurate read-based metagenome characterization using a hierarchical suite of unique signatures.

Nucleic Acids Res. 2015 May 26;43(10):e69. doi: 10.1093/nar/gkv180. Epub 2015 Mar 12.

Ancient and modern environmental DNA.

Philos Trans R Soc Lond B Biol Sci. 2015 Jan 19;370(1660):20130383. doi: 10.1098/rstb.2013.0383.

Metatranscriptome profiling of a harmful algal bloom.

Harmful Algae. 2014 Jul;37:75-83. doi: 10.1016/j.hal.2014.04.016.

Seeing the forest for the genes: using metagenomics to infer the aggregated traits of microbial communities.

Front Microbiol. 2014 Nov 12;5:614. doi: 10.3389/fmicb.2014.00614. eCollection 2014.

Fast and sensitive protein alignment using DIAMOND.

Nat Methods. 2015 Jan;12(1):59-60. doi: 10.1038/nmeth.3176. Epub 2014 Nov 17.

Rfam 12.0: updates to the RNA families database.

Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.

Taxator-tk: precise taxonomic assignment of metagenomes by fast approximation of evolutionary neighborhoods.

Bioinformatics. 2015 Mar 15;31(6):817-24. doi: 10.1093/bioinformatics/btu745. Epub 2014 Nov 10.

Relating the metatranscriptome and metagenome of the human gut.

Proc Natl Acad Sci U S A. 2014 Jun 3;111(22):E2329-38. doi: 10.1073/pnas.1319284111. Epub 2014 May 19.

Trimmomatic: a flexible trimmer for Illumina sequence data.

Bioinformatics. 2014 Aug 1;30(15):2114-20. doi: 10.1093/bioinformatics/btu170. Epub 2014 Apr 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

宏基因组分析工具的准确性和速度评估。

An evaluation of the accuracy and speed of metagenome analysis tools.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献