• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CHM13-T2T基因组的使用通过最大限度减少宿主DNA污染改进了宏基因组分析。

Use of the CHM13-T2T genome improves metagenomic analysis by minimizing host DNA contamination.

作者信息

Liu Donglai, Hu Jinjun, Zhang Dan, Ren Shanshan, Zhao Lanqing, Gao Hongyan, Hu Songnian, Xu Sihong, Liang Guanxiang

机构信息

National Institutes for Food and Drug Control, Beijing, China.

Center for Infection Biology, School of Basic Medical Sciences, Tsinghua University, Beijing, China.

出版信息

mSystems. 2025 Sep 10:e0084025. doi: 10.1128/msystems.00840-25.

DOI:10.1128/msystems.00840-25
PMID:40928236
Abstract

Human-associated metagenomic data often contain human nucleic acid information, which can affect the accuracy of microbial classification or raise ethical concerns. These reads are typically removed through alignment to the human genome using various metagenomic mapping tools or human reference genomes, followed by filtration before metagenomic analysis. In this study, we conducted a comprehensive analysis to identify the optimal combination of alignment software and human reference genomes using benchmarking data. Our findings show that the combination of bwa-mem and the telomere-to-telomere human genome (CHM13-T2T) is the most effective in removing human reads in simulated data. We also analyzed CHM13-T2T-derived sequences in RefSeq to understand how CHM13-T2T reduces false positive results. Finally, we assessed clinical samples and found that CHM13-T2T effectively reduces host-derived contamination, particularly in low microbial biomass samples. This study provides a thorough overview of the application of CHM13-T2T in metagenomic analysis and highlights its significance in improving microbial classification accuracy.IMPORTANCEHuman gene sequences account for a large proportion of metagenomic sequences. To gain accurate and precise microbiome information, effective host-derived contamination removal methods are required. Both the alignment algorithm and the reference genome could influence the effectiveness of this process. The telomere-to-telomere human genome (CHM13-T2T) is a state-of-the-art human genome with 216 Mbp of additional new sequences compared with the commonly used GRCh38.p14. Our findings show the optimal dehosting effect of CHM13-T2T combined with the bwa-mem software in metagenomic analysis. We also investigate the reasons for the superiority of CHM13-T2T. Our study provides insights into optimal strategies for host sequence removal from metagenomic data. A standard reference is proposed for future metagenomic analysis, which can improve the accuracy of microbial identification.

摘要

与人类相关的宏基因组数据通常包含人类核酸信息,这可能会影响微生物分类的准确性或引发伦理问题。这些 reads 通常通过使用各种宏基因组映射工具或人类参考基因组与人类基因组进行比对来去除,然后在宏基因组分析之前进行过滤。在本研究中,我们使用基准数据进行了全面分析,以确定比对软件和人类参考基因组的最佳组合。我们的研究结果表明,bwa-mem 和端粒到端粒的人类基因组(CHM13-T2T)的组合在去除模拟数据中的人类 reads 方面最为有效。我们还分析了 RefSeq 中源自 CHM13-T2T 的序列,以了解 CHM13-T2T 如何减少假阳性结果。最后,我们评估了临床样本,发现 CHM13-T2T 有效地减少了宿主来源的污染,特别是在低微生物生物量样本中。本研究全面概述了 CHM13-T2T 在宏基因组分析中的应用,并强调了其在提高微生物分类准确性方面的重要性。重要性人类基因序列在宏基因组序列中占很大比例。为了获得准确和精确的微生物组信息,需要有效的宿主来源污染去除方法。比对算法和参考基因组都可能影响这一过程的有效性。端粒到端粒的人类基因组(CHM13-T2T)是一种先进的人类基因组,与常用的 GRCh38.p14 相比,有 216 Mbp 的额外新序列。我们的研究结果表明,CHM13-T2T 与 bwa-mem 软件结合在宏基因组分析中具有最佳的去宿主效果。我们还研究了 CHM13-T2T 优越性的原因。我们的研究为从宏基因组数据中去除宿主序列的最佳策略提供了见解。为未来的宏基因组分析提出了一个标准参考,这可以提高微生物鉴定的准确性。

相似文献

1
Use of the CHM13-T2T genome improves metagenomic analysis by minimizing host DNA contamination.CHM13-T2T基因组的使用通过最大限度减少宿主DNA污染改进了宏基因组分析。
mSystems. 2025 Sep 10:e0084025. doi: 10.1128/msystems.00840-25.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
Precise Identification of Higher-Order Repeats (HORs) in T2T-CHM13 Assembly of Human Chromosome 21-Novel 52mer HOR and Failures of Hg38 Assembly.人类21号染色体T2T-CHM13组装中高阶重复序列(HORs)的精确鉴定——新型52聚体HOR及Hg38组装的失败
Genes (Basel). 2025 Jul 27;16(8):885. doi: 10.3390/genes16080885.
5
A T2T-CHM13 recombination map and globally diverse haplotype reference panel improves phasing and imputation.一个端粒到端粒的人类基因组参考组装(T2T-CHM13)重组图谱和全球多样化单倍型参考面板改善了定相和填充。
bioRxiv. 2025 Feb 28:2025.02.24.639687. doi: 10.1101/2025.02.24.639687.
6
Decontamination of DNA sequences from a Streptomyces genome for optimal genome mining.对链霉菌基因组中的DNA序列进行净化以实现最佳基因组挖掘。
Braz J Microbiol. 2025 Mar;56(1):79-89. doi: 10.1007/s42770-024-01598-2. Epub 2025 Jan 15.
7
Leveraging the T2T assembly to resolve rare and pathogenic inversions in reference genome gaps.利用 T2T 组装技术解决参考基因组缺口处的罕见致病性倒位。
Genome Res. 2024 Nov 20;34(11):1785-1797. doi: 10.1101/gr.279346.124.
8
Enhanced resolution of optical genome mapping utilizing telomere-to-telomere reference in genetic disorders.利用端粒到端粒参考序列提高光学基因组图谱在遗传疾病中的分辨率。
Eur J Hum Genet. 2024 Dec 9. doi: 10.1038/s41431-024-01763-z.
9
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
10
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.