• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于去除宿主污染的短读长宏基因组学工具的基准测试

Benchmarking short-read metagenomics tools for removing host contamination.

作者信息

Yunyun Gao, Hao Luo, Hujie Lyu, Haifei Yang, Salsabeel Yousuf, Shi Huang, Yong-Xin Liu

机构信息

Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China.

Department of Life Sciences, Imperial College of London, London SW7 2AZ, UK.

出版信息

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf004.

DOI:10.1093/gigascience/giaf004
PMID:40036691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11878760/
Abstract

BACKGROUND

The rapid evolution of metagenomic sequencing technology offers remarkable opportunities to explore the intricate roles of microbiome in host health and disease, as well as to uncover the unknown structure and functions of microbial communities. However, the swift accumulation of metagenomic data poses substantial challenges for data analysis. Contamination from host DNA can substantially compromise result accuracy and increase additional computational resources by including nontarget sequences.

RESULTS

In this study, we assessed the impact of computational host DNA decontamination on downstream analyses, highlighting its importance in producing accurate results efficiently. We also evaluated the performance of conventional tools like KneadData, Bowtie2, BWA, KMCP, Kraken2, and KrakenUniq, each offering unique advantages for different applications. Furthermore, we highlighted the importance of an accurate host reference genome, noting that its absence negatively affected the decontamination performance across all tools.

CONCLUSIONS

Our findings underscore the need for careful selection of decontamination tools and reference genomes to enhance the accuracy of metagenomic analyses. These insights provide valuable guidance for improving the reliability and reproducibility of microbiome research.

摘要

背景

宏基因组测序技术的快速发展为探索微生物组在宿主健康和疾病中的复杂作用以及揭示微生物群落未知的结构和功能提供了显著机遇。然而,宏基因组数据的迅速积累给数据分析带来了巨大挑战。宿主DNA污染会通过纳入非目标序列严重损害结果准确性并增加额外的计算资源。

结果

在本研究中,我们评估了计算去除宿主DNA对下游分析的影响,强调了其在高效产生准确结果方面的重要性。我们还评估了KneadData、Bowtie2、BWA、KMCP、Kraken2和KrakenUniq等传统工具的性能,每个工具在不同应用中都具有独特优势。此外,我们强调了准确的宿主参考基因组的重要性,指出其缺失会对所有工具的去除污染性能产生负面影响。

结论

我们的研究结果强调了仔细选择去除污染工具和参考基因组以提高宏基因组分析准确性的必要性。这些见解为提高微生物组研究的可靠性和可重复性提供了有价值的指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/3a5e6585813d/giaf004fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/0dcfa77227a3/giaf004fig1g.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/c21802cf0f35/giaf004fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/6509c76f8a67/giaf004fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/17dd4fa099ec/giaf004fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/3a5e6585813d/giaf004fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/0dcfa77227a3/giaf004fig1g.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/c21802cf0f35/giaf004fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/6509c76f8a67/giaf004fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/17dd4fa099ec/giaf004fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6365/11878760/3a5e6585813d/giaf004fig4.jpg

相似文献

1
Benchmarking short-read metagenomics tools for removing host contamination.用于去除宿主污染的短读长宏基因组学工具的基准测试
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf004.
2
An integrated strain-level analytic pipeline utilizing longitudinal metagenomic data.利用纵向宏基因组数据的综合菌株水平分析管道。
Microbiol Spectr. 2024 Nov 5;12(11):e0143124. doi: 10.1128/spectrum.01431-24. Epub 2024 Sep 23.
3
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
4
HoCoRT: host contamination removal tool.HoCoRT:宿主污染去除工具。
BMC Bioinformatics. 2023 Oct 2;24(1):371. doi: 10.1186/s12859-023-05492-w.
5
Recentrifuge: Robust comparative analysis and contamination removal for metagenomics.Recentrifuge:用于宏基因组学的稳健比较分析和污染去除。
PLoS Comput Biol. 2019 Apr 8;15(4):e1006967. doi: 10.1371/journal.pcbi.1006967. eCollection 2019 Apr.
6
Unveiling microbial diversity: harnessing long-read sequencing technology.揭示微生物多样性:利用长读长测序技术
Nat Methods. 2024 Jun;21(6):954-966. doi: 10.1038/s41592-024-02262-1. Epub 2024 Apr 30.
7
Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.通过验证的视角看宏基因组组装:评估和提高宏基因组组装基因组质量的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1140-1150. doi: 10.1093/bib/bbx098.
8
LEMMI: a continuous benchmarking platform for metagenomics classifiers.LEMMI:用于宏基因组分类器的连续基准测试平台。
Genome Res. 2020 Aug;30(8):1208-1216. doi: 10.1101/gr.260398.119. Epub 2020 Jul 2.
9
CAIM: coverage-based analysis for identification of microbiome.CAIM:基于覆盖度的微生物组分析方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.
10
Benchmarking second and third-generation sequencing platforms for microbial metagenomics.用于微生物宏基因组学的第二代和第三代测序平台的基准测试。
Sci Data. 2022 Nov 11;9(1):694. doi: 10.1038/s41597-022-01762-z.

引用本文的文献

1
Metagenomics reveals fibre fermentation and AMR pathways in red grouse (Lagopus scotica) microbiota.宏基因组学揭示了红松鸡(Lagopus scotica)微生物群中的纤维发酵和抗生素抗性途径。
BMC Microbiol. 2025 Aug 19;25(1):520. doi: 10.1186/s12866-025-04280-1.
2
Dual associations of gut and oral microbial networks with kidney transplantation.肠道和口腔微生物网络与肾移植的双重关联
mSystems. 2025 Aug 19;10(8):e0025225. doi: 10.1128/msystems.00252-25. Epub 2025 Jul 9.

本文引用的文献

1
EasyMetagenome: A user-friendly and flexible pipeline for shotgun metagenomic analysis in microbiome research.简易宏基因组:微生物组研究中用于鸟枪法宏基因组分析的用户友好型灵活流程。
Imeta. 2025 Feb 14;4(1):e70001. doi: 10.1002/imt2.70001. eCollection 2025 Feb.
2
Impact of database choice and confidence score on the performance of taxonomic classification using Kraken2.数据库选择和置信度分数对使用Kraken2进行分类学分类性能的影响。
aBIOTECH. 2024 Jul 31;5(4):465-475. doi: 10.1007/s42994-024-00178-0. eCollection 2024 Dec.
3
The Microbiome Protocols eBook initiative: Building a bridge to microbiome research.
微生物组协议电子书计划:搭建通往微生物组研究的桥梁。
Imeta. 2024 Mar 19;3(2):e182. doi: 10.1002/imt2.182. eCollection 2024 Apr.
4
EasyAmplicon: An easy-to-use, open-source, reproducible, and community-based pipeline for amplicon data analysis in microbiome research.EasyAmplicon:一种易于使用、开源、可重复且基于社区的流程,用于微生物组研究中的扩增子数据分析。
Imeta. 2023 Jan 27;2(1):e83. doi: 10.1002/imt2.83. eCollection 2023 Feb.
5
MultiPrime: A reliable and efficient tool for targeted next-generation sequencing.多重引物法:一种用于靶向新一代测序的可靠且高效的工具。
Imeta. 2023 Oct 19;2(4):e143. doi: 10.1002/imt2.143. eCollection 2023 Nov.
6
Hostile: accurate decontamination of microbial host sequences.宿主敌对:微生物宿主序列的精确净化。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad728.
7
Microbial-enrichment method enables high-throughput metagenomic characterization from host-rich samples.微生物富集方法可实现宿主相关样本的高通量宏基因组特征分析。
Nat Methods. 2023 Nov;20(11):1672-1682. doi: 10.1038/s41592-023-02025-4. Epub 2023 Oct 12.
8
Major data analysis errors invalidate cancer microbiome findings.主要数据分析错误使癌症微生物组研究结果无效。
mBio. 2023 Oct 31;14(5):e0160723. doi: 10.1128/mbio.01607-23. Epub 2023 Oct 9.
9
HoCoRT: host contamination removal tool.HoCoRT:宿主污染去除工具。
BMC Bioinformatics. 2023 Oct 2;24(1):371. doi: 10.1186/s12859-023-05492-w.
10
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning.CheckM2:一种使用机器学习快速、可扩展且准确评估微生物基因组质量的工具。
Nat Methods. 2023 Aug;20(8):1203-1212. doi: 10.1038/s41592-023-01940-w. Epub 2023 Jul 27.