使用分层独特特征套件进行基于读取的准确宏基因组表征。

Accurate read-based metagenome characterization using a hierarchical suite of unique signatures.

作者信息

Freitas Tracey Allen K, Li Po-E, Scholz Matthew B, Chain Patrick S G

机构信息

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA

出版信息

Nucleic Acids Res. 2015 May 26;43(10):e69. doi: 10.1093/nar/gkv180. Epub 2015 Mar 12.

DOI:10.1093/nar/gkv180

PMID:25765641

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4446416/

Abstract

A major challenge in the field of shotgun metagenomics is the accurate identification of organisms present within a microbial community, based on classification of short sequence reads. Though existing microbial community profiling methods have attempted to rapidly classify the millions of reads output from modern sequencers, the combination of incomplete databases, similarity among otherwise divergent genomes, errors and biases in sequencing technologies, and the large volumes of sequencing data required for metagenome sequencing has led to unacceptably high false discovery rates (FDR). Here, we present the application of a novel, gene-independent and signature-based metagenomic taxonomic profiling method with significantly and consistently smaller FDR than any other available method. Our algorithm circumvents false positives using a series of non-redundant signature databases and examines Genomic Origins Through Taxonomic CHAllenge (GOTTCHA). GOTTCHA was tested and validated on 20 synthetic and mock datasets ranging in community composition and complexity, was applied successfully to data generated from spiked environmental and clinical samples, and robustly demonstrates superior performance compared with other available tools.

摘要

鸟枪法宏基因组学领域的一个主要挑战是，基于短序列 reads 的分类，准确识别微生物群落中存在的生物体。尽管现有的微生物群落分析方法试图快速对现代测序仪输出的数百万条 reads 进行分类，但不完整的数据库、不同基因组之间的相似性、测序技术中的错误和偏差，以及宏基因组测序所需的大量测序数据，导致了高得令人无法接受的错误发现率（FDR）。在这里，我们展示了一种新颖的、基于基因独立和特征的宏基因组分类分析方法的应用，该方法的 FDR 显著且始终低于任何其他现有方法。我们的算法使用一系列非冗余特征数据库规避假阳性，并通过分类挑战检验基因组起源（GOTTCHA）。GOTTCHA 在 20 个合成和模拟数据集上进行了测试和验证，这些数据集的群落组成和复杂性各不相同，并成功应用于加标环境和临床样本生成的数据，与其他现有工具相比，有力地证明了其卓越的性能。

相似文献

Accurate read-based metagenome characterization using a hierarchical suite of unique signatures.使用分层独特特征套件进行基于读取的准确宏基因组表征。

Nucleic Acids Res. 2015 May 26;43(10):e69. doi: 10.1093/nar/gkv180. Epub 2015 Mar 12.

MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序：一种合成方法。

Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.

Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。

BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.

CAIM: coverage-based analysis for identification of microbiome.CAIM：基于覆盖度的微生物组分析方法。

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.

Sketching and sampling approaches for fast and accurate long read classification.快速准确的长读分类的草图和采样方法。

BMC Bioinformatics. 2022 Oct 31;23(1):452. doi: 10.1186/s12859-022-05014-0.

RAIphy: phylogenetic classification of metagenomics samples using iterative refinement of relative abundance index profiles.RAIphy：基于相对丰度指数轮廓的迭代细化对宏基因组样本进行系统发育分类。

BMC Bioinformatics. 2011 Jan 31;12:41. doi: 10.1186/1471-2105-12-41.

Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.通过验证的视角看宏基因组组装：评估和提高宏基因组组装基因组质量的最新进展。

Brief Bioinform. 2019 Jul 19;20(4):1140-1150. doi: 10.1093/bib/bbx098.

Species classifier choice is a key consideration when analysing low-complexity food microbiome data.在分析低复杂度食品微生物组数据时，物种分类器的选择是一个关键考虑因素。

Microbiome. 2018 Mar 20;6(1):50. doi: 10.1186/s40168-018-0437-0.

Filtration and Normalization of Sequencing Read Data in Whole-Metagenome Shotgun Samples.全基因组鸟枪法样本中测序读段数据的过滤与标准化

PLoS One. 2016 Oct 19;11(10):e0165015. doi: 10.1371/journal.pone.0165015. eCollection 2016.

CAMISIM: simulating metagenomes and microbial communities.CAMISIM：模拟宏基因组和微生物群落。

Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.

引用本文的文献

Advancing metagenomic classification with NABAS+: a novel alignment-based approach.使用NABAS+推进宏基因组分类：一种基于比对的新方法。

NAR Genom Bioinform. 2025 Jul 4;7(3):lqaf092. doi: 10.1093/nargab/lqaf092. eCollection 2025 Sep.

Addressing the dynamic nature of reference data: a new nucleotide database for robust metagenomic classification.应对参考数据的动态特性：一个用于可靠宏基因组分类的新核苷酸数据库。

mSystems. 2025 Apr 22;10(4):e0123924. doi: 10.1128/msystems.01239-24. Epub 2025 Mar 20.

Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource.通过NMDC EDGE资源实现标准化且可访问的多组学生物信息学工作流程。

Comput Struct Biotechnol J. 2024 Sep 27;23:3575-3583. doi: 10.1016/j.csbj.2024.09.018. eCollection 2024 Dec.

Metagenomic profiling of nasopharyngeal samples from adults with acute respiratory infection.对患有急性呼吸道感染的成年人鼻咽样本进行宏基因组分析。

R Soc Open Sci. 2024 Jul 10;11(7):240108. doi: 10.1098/rsos.240108. eCollection 2024 Jul.

Combining compositional data sets introduces error in covariance network reconstruction.合并成分数据集会在协方差网络重建中引入误差。

ISME Commun. 2024 Apr 19;4(1):ycae057. doi: 10.1093/ismeco/ycae057. eCollection 2024 Jan.

Genomic fingerprints of the world's soil ecosystems.世界土壤生态系统的基因组指纹图谱。

mSystems. 2024 Jun 18;9(6):e0111223. doi: 10.1128/msystems.01112-23. Epub 2024 May 9.

Sea cucumber () intestinal microbiome dataset from Puerto Rico, generated by shotgun sequencing.来自波多黎各的海参肠道微生物组数据集，通过鸟枪法测序生成。

Data Brief. 2024 Apr 15;54:110421. doi: 10.1016/j.dib.2024.110421. eCollection 2024 Jun.

Correlation between the gut microbiome and neurodegenerative diseases: a review of metagenomics evidence.肠道微生物群与神经退行性疾病之间的关联：宏基因组学证据综述

Neural Regen Res. 2024 Apr;19(4):833-845. doi: 10.4103/1673-5374.382223.

Complex organic matter degradation by secondary consumers in chemolithoautotrophy-based subsurface geothermal ecosystems.次生消费者在基于化能自养的地下地热生态系统中对复杂有机物的降解。

PLoS One. 2023 Aug 18;18(8):e0281277. doi: 10.1371/journal.pone.0281277. eCollection 2023.

Comparing variability in diagnosis of upper respiratory tract infections in patients using syndromic, next generation sequencing, and PCR-based methods.比较使用症状诊断法、下一代测序法和基于聚合酶链反应的方法对患者上呼吸道感染进行诊断时的变异性。

PLOS Glob Public Health. 2022 Jul 20;2(7):e0000811. doi: 10.1371/journal.pgph.0000811. eCollection 2022.

本文引用的文献

Metagenomic species profiling using universal phylogenetic marker genes.基于通用系统发育标记基因的宏基因组物种分析。

Nat Methods. 2013 Dec;10(12):1196-9. doi: 10.1038/nmeth.2693. Epub 2013 Oct 20.

The origin of biased sequence depth in sequence-independent nucleic acid amplification and optimization for efficient massive parallel sequencing.序列非依赖性核酸扩增中偏倚序列深度的起源及高效大规模平行测序的优化。

PLoS One. 2013 Sep 26;8(9):e76144. doi: 10.1371/journal.pone.0076144. eCollection 2013.

Comparison of DNA extraction methods in analysis of salivary bacterial communities.比较唾液细菌群落分析中 DNA 提取方法。

PLoS One. 2013 Jul 3;8(7):e67699. doi: 10.1371/journal.pone.0067699. Print 2013.

Kraken: a set of tools for quality control and analysis of high-throughput sequence data.Kraken：一组用于高通量测序数据质量控制和分析的工具。

Methods. 2013 Sep 1;63(1):41-9. doi: 10.1016/j.ymeth.2013.06.027. Epub 2013 Jun 29.

Random sampling process leads to overestimation of β-diversity of microbial communities.随机抽样过程导致微生物群落β多样性的高估。

mBio. 2013 Jun 11;4(3):e00324-13. doi: 10.1128/mBio.00324-13.

Benchmarking short sequence mapping tools.短序列比对工具的基准测试。

BMC Bioinformatics. 2013 Jun 7;14:184. doi: 10.1186/1471-2105-14-184.

Sequencing platform and library preparation choices impact viral metagenomes.测序平台和文库制备方案的选择会影响病毒宏基因组。

BMC Genomics. 2013 May 10;14:320. doi: 10.1186/1471-2164-14-320.

Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms.基于组合的短宏基因组序列分类阐明了微生物分类和功能丰富度的景观。

Nucleic Acids Res. 2013 Jan 7;41(1):e3. doi: 10.1093/nar/gks828. Epub 2012 Aug 31.

Rapid phylogenetic and functional classification of short genomic fragments with signature peptides.利用特征肽对短基因组片段进行快速系统发育和功能分类。

BMC Res Notes. 2012 Aug 28;5:460. doi: 10.1186/1756-0500-5-460.

Novel bacterial taxa in the human microbiome.人类微生物组中的新型细菌分类群。

PLoS One. 2012;7(6):e35294. doi: 10.1371/journal.pone.0035294. Epub 2012 Jun 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用分层独特特征套件进行基于读取的准确宏基因组表征。

Accurate read-based metagenome characterization using a hierarchical suite of unique signatures.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献