宏基因组解读的批判性评估——宏基因组学软件的一项基准测试

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

作者信息

Sczyrba Alexander, Hofmann Peter, Belmann Peter, Koslicki David, Janssen Stefan, Dröge Johannes, Gregor Ivan, Majda Stephan, Fiedler Jessika, Dahms Eik, Bremges Andreas, Fritz Adrian, Garrido-Oter Ruben, Jørgensen Tue Sparholt, Shapiro Nicole, Blood Philip D, Gurevich Alexey, Bai Yang, Turaev Dmitrij, DeMaere Matthew Z, Chikhi Rayan, Nagarajan Niranjan, Quince Christopher, Meyer Fernando, Balvočiūtė Monika, Hansen Lars Hestbjerg, Sørensen Søren J, Chia Burton K H, Denis Bertrand, Froula Jeff L, Wang Zhong, Egan Robert, Don Kang Dongwan, Cook Jeffrey J, Deltel Charles, Beckstette Michael, Lemaitre Claire, Peterlongo Pierre, Rizk Guillaume, Lavenier Dominique, Wu Yu-Wei, Singer Steven W, Jain Chirag, Strous Marc, Klingenberg Heiner, Meinicke Peter, Barton Michael D, Lingner Thomas, Lin Hsin-Hung, Liao Yu-Chieh, Silva Genivaldo Gueiros Z, Cuevas Daniel A, Edwards Robert A, Saha Surya, Piro Vitor C, Renard Bernhard Y, Pop Mihai, Klenk Hans-Peter, Göker Markus, Kyrpides Nikos C, Woyke Tanja, Vorholt Julia A, Schulze-Lefert Paul, Rubin Edward M, Darling Aaron E, Rattei Thomas, McHardy Alice C

机构信息

Faculty of Technology, Bielefeld University, Bielefeld, Germany.

Center for Biotechnology, Bielefeld University, Bielefeld, Germany.

出版信息

Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.

DOI:10.1038/nmeth.4458

PMID:28967888

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5903868/

Abstract

Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.

摘要

组装、分类学分析和分箱方法是解释宏基因组数据的关键，但缺乏关于基准测试的共识使性能评估变得复杂。宏基因组解释关键评估（CAMI）挑战赛促使全球开发者社区在高度复杂且逼真的数据集上对其程序进行基准测试，这些数据集由约700种新测序的微生物以及约600种新型病毒和质粒生成，并代表常见的实验设置。组装和基因组分箱程序对于由单个基因组代表的物种表现良好，但受到相关菌株的显著影响。分类学分析和分箱程序在高分类级别上表现出色，在科级以下性能显著下降。参数设置显著影响性能，突出了它们对程序可重复性的重要性。CAMI结果突出了当前的挑战，但也为选择软件以回答特定研究问题提供了路线图。

相似文献

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.

Critical Assessment of Metagenome Interpretation: the second round of challenges.

Nat Methods. 2022 Apr;19(4):429-440. doi: 10.1038/s41592-022-01431-4. Epub 2022 Apr 8.

CAMISIM: simulating metagenomes and microbial communities.

Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.

Tutorial: assessing metagenomics software with the CAMI benchmarking toolkit.

Nat Protoc. 2021 Apr;16(4):1785-1801. doi: 10.1038/s41596-020-00480-3. Epub 2021 Mar 1.

CAMI Benchmarking Portal: online evaluation and ranking of metagenomic software.

Nucleic Acids Res. 2025 Jul 7;53(W1):W102-W109. doi: 10.1093/nar/gkaf369.

Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.

BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.

AMBER: Assessment of Metagenome BinnERs.

Gigascience. 2018 Jun 1;7(6). doi: 10.1093/gigascience/giy069.

Tamock: simulation of habitat-specific benchmark data in metagenomics.

BMC Bioinformatics. 2021 May 1;22(1):227. doi: 10.1186/s12859-021-04154-z.

Benchmarking different approaches for Norovirus genome assembly in metagenome samples.

BMC Genomics. 2021 Nov 24;22(1):849. doi: 10.1186/s12864-021-08067-2.

Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.

BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.

引用本文的文献

Long read metagenomics-based precise tracking of bacterial strains and genomic changes after fecal microbiota transplantation.

bioRxiv. 2025 Aug 11:2024.09.30.615906. doi: 10.1101/2024.09.30.615906.

Clinical metagenomics for diagnosis and surveillance of viral pathogens.

Nat Rev Microbiol. 2025 Aug 13. doi: 10.1038/s41579-025-01223-5.

Analysis of metagenomic data.

Nat Rev Methods Primers. 2025;5. doi: 10.1038/s43586-024-00376-6. Epub 2025 Jan 23.

Metagenomics-Toolkit: the flexible and efficient cloud-based metagenomics workflow featuring machine learning-enabled resource allocation.

NAR Genom Bioinform. 2025 Jul 17;7(3):lqaf093. doi: 10.1093/nargab/lqaf093. eCollection 2025 Sep.

ganon2: up-to-date and scalable metagenomics analysis.

NAR Genom Bioinform. 2025 Jul 17;7(3):lqaf094. doi: 10.1093/nargab/lqaf094. eCollection 2025 Sep.

Comprehensive taxonomic identification of microbial species in metagenomic data using SingleM and Sandpiper.

Nat Biotechnol. 2025 Jul 16. doi: 10.1038/s41587-025-02738-1.

Constructing inflammatory bowel disease diagnostic models based on k-mer and machine learning.

Front Microbiol. 2025 Jun 25;16:1578005. doi: 10.3389/fmicb.2025.1578005. eCollection 2025.

Advancing metagenomic classification with NABAS+: a novel alignment-based approach.

NAR Genom Bioinform. 2025 Jul 4;7(3):lqaf092. doi: 10.1093/nargab/lqaf092. eCollection 2025 Sep.

Eukaryotic composition across seasons and social groups in the gut microbiota of wild baboons.

Anim Microbiome. 2025 Jun 21;7(1):70. doi: 10.1186/s42523-025-00436-6.

ARGContextProfiler: extracting and scoring the genomic contexts of antibiotic resistance genes using assembly graphs.

Front Microbiol. 2025 May 21;16:1604461. doi: 10.3389/fmicb.2025.1604461. eCollection 2025.

本文引用的文献

SILVA, RDP, Greengenes, NCBI and OTT - how do these taxonomies compare?

BMC Genomics. 2017 Mar 14;18(Suppl 2):114. doi: 10.1186/s12864-017-3501-4.

MetaPalette: a -mer Painting Approach for Metagenomic Taxonomic Profiling and Quantification of Novel Strain Variation.

mSystems. 2016 Jun 7;1(3). doi: 10.1128/mSystems.00020-16. eCollection 2016 May-Jun.

IMG/M: integrated genome and metagenome comparative data analysis system.

Nucleic Acids Res. 2017 Jan 4;45(D1):D507-D516. doi: 10.1093/nar/gkw929. Epub 2016 Oct 13.

MEGAN Community Edition - Interactive Exploration and Analysis of Large-Scale Microbiome Sequencing Data.

PLoS Comput Biol. 2016 Jun 21;12(6):e1004957. doi: 10.1371/journal.pcbi.1004957. eCollection 2016 Jun.

Natural history of the infant gut microbiome and impact of antibiotic treatment on bacterial strain diversity and stability.

Sci Transl Med. 2016 Jun 15;8(343):343ra81. doi: 10.1126/scitranslmed.aad0917.

DUDes: a top-down taxonomic profiler for metagenomics.

Bioinformatics. 2016 Aug 1;32(15):2272-80. doi: 10.1093/bioinformatics/btw150. Epub 2016 Mar 24.

Durable coexistence of donor and recipient strains after fecal microbiota transplantation.

Science. 2016 Apr 29;352(6285):586-9. doi: 10.1126/science.aad8852.

Microbiology: the road to strain-level identification.

Nat Methods. 2016 Apr 28;13(5):401-4. doi: 10.1038/nmeth.3837.

High definition for systems biology of microbial communities: metagenomics gets genome-centric and strain-resolved.

Curr Opin Biotechnol. 2016 Jun;39:174-181. doi: 10.1016/j.copbio.2016.04.011. Epub 2016 Apr 23.

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.

Sci Rep. 2016 Apr 12;6:24175. doi: 10.1038/srep24175.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

宏基因组解读的批判性评估——宏基因组学软件的一项基准测试

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

作者信息

机构信息

Faculty of Technology, Bielefeld University, Bielefeld, Germany.

Center for Biotechnology, Bielefeld University, Bielefeld, Germany.

出版信息

Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.

DOI:10.1038/nmeth.4458

PMID:28967888

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5903868/

Abstract

摘要

宏基因组解读的批判性评估——宏基因组学软件的一项基准测试

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

宏基因组解读的批判性评估——宏基因组学软件的一项基准测试

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

作者信息

机构信息

出版信息