• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过靶向 IIB 型限制位点去除宏基因组分类分析中的假阳性。

Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites.

机构信息

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.

Qingdao OE Biotechnology Company Limited, Qingdao, Shandong, China.

出版信息

Nat Commun. 2023 Sep 1;14(1):5321. doi: 10.1038/s41467-023-41099-8.

DOI:10.1038/s41467-023-41099-8
PMID:37658057
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10474111/
Abstract

Accurate species identification and abundance estimation are critical for the interpretation of whole metagenome sequencing (WMS) data. Yet, existing metagenomic profilers suffer from false-positive identifications, which can account for more than 90% of total identified species. Here, by leveraging species-specific Type IIB restriction endonuclease digestion sites as reference instead of universal markers or whole microbial genomes, we present a metagenomic profiler, MAP2B (MetAgenomic Profiler based on type IIB restriction sites), to resolve those issues. We first illustrate the pitfalls of using relative abundance as the only feature in determining false positives. We then propose a feature set to distinguish false positives from true positives, and using simulated metagenomes from CAMI2, we establish a false-positive recognition model. By benchmarking the performance in metagenomic profiling using a simulation dataset with varying sequencing depth and species richness, we illustrate the superior performance of MAP2B over existing metagenomic profilers in species identification. We further test the performance of MAP2B using real WMS data from an ATCC mock community, confirming its superior precision against sequencing depth. Finally, by leveraging WMS data from an IBD cohort, we demonstrate the taxonomic features generated by MAP2B can better discriminate IBD and predict metabolomic profiles.

摘要

准确的物种鉴定和丰度估计对于解释全宏基因组测序(WMS)数据至关重要。然而,现有的宏基因组分析器存在假阳性鉴定问题,这些问题可能占总鉴定物种的 90%以上。在这里,我们利用物种特异性的 Type IIB 限制内切酶消化位点作为参考,而不是通用标记或整个微生物基因组,提出了一种宏基因组分析器 MAP2B(基于 Type IIB 限制位点的宏基因组分析器),以解决这些问题。我们首先说明了仅使用相对丰度作为确定假阳性的唯一特征所存在的问题。然后,我们提出了一个特征集来区分假阳性和真阳性,并使用来自 CAMI2 的模拟宏基因组建立了假阳性识别模型。通过使用具有不同测序深度和物种丰富度的模拟数据集来评估宏基因组分析的性能,我们说明了 MAP2B 在物种鉴定方面优于现有宏基因组分析器的性能。我们进一步使用来自 ATCC 模拟群落的真实 WMS 数据测试了 MAP2B 的性能,证实了其在对抗测序深度方面的卓越精度。最后,通过利用来自 IBD 队列的 WMS 数据,我们证明了 MAP2B 生成的分类特征可以更好地区分 IBD 并预测代谢组学特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9fc6/10474111/87d8950ad45b/41467_2023_41099_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9fc6/10474111/87d8950ad45b/41467_2023_41099_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9fc6/10474111/87d8950ad45b/41467_2023_41099_Fig1_HTML.jpg

相似文献

1
Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites.通过靶向 IIB 型限制位点去除宏基因组分类分析中的假阳性。
Nat Commun. 2023 Sep 1;14(1):5321. doi: 10.1038/s41467-023-41099-8.
2
Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。
BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.
3
Challenges in benchmarking metagenomic profilers.宏基因组分析工具的基准测试挑战。
Nat Methods. 2021 Jun;18(6):618-626. doi: 10.1038/s41592-021-01141-3. Epub 2021 May 13.
4
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles.坦帕:基于宏基因组的分类丰度图谱的可解释分析和可视化。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad008. Epub 2023 Feb 28.
5
Benchmarking Metagenomic Classifiers on Simulated Ancient and Modern Metagenomic Data.基于模拟的古代和现代宏基因组数据对宏基因组分类器进行基准测试。
Microorganisms. 2023 Oct 2;11(10):2478. doi: 10.3390/microorganisms11102478.
6
Crowdsourced benchmarking of taxonomic metagenome profilers: lessons learned from the sbv IMPROVER Microbiomics challenge.众包分类学宏基因组分析器的基准测试:从 sbv IMPROVER 微生物组学挑战赛中吸取的经验教训。
BMC Genomics. 2022 Aug 30;23(1):624. doi: 10.1186/s12864-022-08803-2.
7
Cultivation-independent genomes greatly expand taxonomic-profiling capabilities of mOTUs across various environments.非培养基因组极大地扩展了 mOTU 在各种环境中的分类鉴定能力。
Microbiome. 2022 Dec 5;10(1):212. doi: 10.1186/s40168-022-01410-z.
8
Comprehensive benchmarking and ensemble approaches for metagenomic classifiers.元基因组分类器的综合基准测试和集成方法。
Genome Biol. 2017 Sep 21;18(1):182. doi: 10.1186/s13059-017-1299-7.
9
CAIM: coverage-based analysis for identification of microbiome.CAIM:基于覆盖度的微生物组分析方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.
10
Gauge your phage: benchmarking of bacteriophage identification tools in metagenomic sequencing data.评估噬菌体:宏基因组测序数据中噬菌体鉴定工具的基准测试。
Microbiome. 2023 Apr 21;11(1):84. doi: 10.1186/s40168-023-01533-x.

引用本文的文献

1
Exploring extreme environments in Türkiye for novel P450s through metagenomic analysis.通过宏基因组分析在土耳其探索极端环境以寻找新型细胞色素P450。
PLoS One. 2025 Sep 8;20(9):e0330523. doi: 10.1371/journal.pone.0330523. eCollection 2025.
2
Challenges and Opportunities in Analyzing Cancer-Associated Microbiomes.分析癌症相关微生物群的挑战与机遇
Cancer Res. 2025 Aug 12. doi: 10.1158/0008-5472.CAN-24-3629.
3
2bRAD-M reveals circulating microbiome in chronic antibody-mediated rejection (CAMR) and IgA nephropathy (IgAN) after kidney transplantation.

本文引用的文献

1
Predicting metabolomic profiles from microbial composition through neural ordinary differential equations.通过神经常微分方程从微生物组成预测代谢组学图谱。
Nat Mach Intell. 2023 Mar;5(3):284-293. doi: 10.1038/s42256-023-00627-3. Epub 2023 Mar 13.
2
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4.利用 MetaPhlAn 4 对未鉴定物种进行宏基因组分类分析的扩展和改进。
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
3
Cultivation-independent genomes greatly expand taxonomic-profiling capabilities of mOTUs across various environments.
2bRAD-M揭示了肾移植后慢性抗体介导性排斥反应(CAMR)和IgA肾病(IgAN)中的循环微生物群。
BMC Microbiol. 2025 Jul 31;25(1):468. doi: 10.1186/s12866-025-04103-3.
4
Comprehensive taxonomic identification of microbial species in metagenomic data using SingleM and Sandpiper.使用SingleM和矶鹬对宏基因组数据中的微生物物种进行全面的分类鉴定。
Nat Biotechnol. 2025 Jul 16. doi: 10.1038/s41587-025-02738-1.
5
2bRAD-M Reveals the Characteristics of Urinary Microbiota in Overweight Patients with Urinary Tract Stones.2bRAD-M技术揭示超重尿路结石患者的尿液微生物群特征。
Biomedicines. 2025 May 14;13(5):1197. doi: 10.3390/biomedicines13051197.
6
Urobiome of patients with diabetic kidney disease in different stages is revealed by 2bRAD-M.2bRAD-M技术揭示了不同阶段糖尿病肾病患者的泌尿微生物组。
J Transl Med. 2025 Apr 10;23(1):414. doi: 10.1186/s12967-025-06405-6.
7
2bRAD-M reveals differences in microbial communities between Modic changes and disc herniation.2bRAD-M技术揭示了Modic改变与椎间盘突出之间微生物群落的差异。
Front Cell Infect Microbiol. 2025 Mar 21;15:1449873. doi: 10.3389/fcimb.2025.1449873. eCollection 2025.
8
Circulating microbiome DNA features and its effect on predicting clinicopathological characteristics of patients with colorectal cancer.循环微生物组DNA特征及其对预测结直肠癌患者临床病理特征的影响。
J Transl Med. 2025 Feb 13;23(1):178. doi: 10.1186/s12967-025-06164-4.
9
RFW captures species-level metagenomic functions by integrating genome annotation information.RFW通过整合基因组注释信息来捕获物种水平的宏基因组功能。
Cell Rep Methods. 2024 Dec 16;4(12):100932. doi: 10.1016/j.crmeth.2024.100932. Epub 2024 Dec 10.
10
MetaAll: integrative bioinformatics workflow for analysing clinical metagenomic data.MetaAll:用于分析临床宏基因组数据的综合生物信息学工作流程。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae597.
非培养基因组极大地扩展了 mOTU 在各种环境中的分类鉴定能力。
Microbiome. 2022 Dec 5;10(1):212. doi: 10.1186/s40168-022-01410-z.
4
Pan-cancer analyses reveal cancer-type-specific fungal ecologies and bacteriome interactions.泛癌分析揭示了癌症类型特异性真菌生态和细菌群落相互作用。
Cell. 2022 Sep 29;185(20):3789-3806.e17. doi: 10.1016/j.cell.2022.09.005.
5
Zebra: Static and Dynamic Genome Cover Thresholds with Overlapping References.斑马:具有重叠参考的静态和动态基因组覆盖阈值。
mSystems. 2022 Oct 26;7(5):e0075822. doi: 10.1128/msystems.00758-22. Epub 2022 Sep 8.
6
Critical Assessment of Metagenome Interpretation: the second round of challenges.宏基因组解读的关键评估:第二轮挑战。
Nat Methods. 2022 Apr;19(4):429-440. doi: 10.1038/s41592-022-01431-4. Epub 2022 Apr 8.
7
Species-resolved sequencing of low-biomass or degraded microbiomes using 2bRAD-M.使用 2bRAD-M 对低生物量或降解微生物组进行种分辨测序。
Genome Biol. 2022 Jan 26;23(1):36. doi: 10.1186/s13059-021-02576-9.
8
Ensembl Genomes 2022: an expanding genome resource for non-vertebrates.Ensembl Genomes 2022:一个不断扩展的非脊椎动物基因组资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D996-D1003. doi: 10.1093/nar/gkab1007.
9
DNA Extraction and Host Depletion Methods Significantly Impact and Potentially Bias Bacterial Detection in a Biological Fluid.DNA提取和宿主去除方法对生物体液中的细菌检测有显著影响,并可能产生偏差。
mSystems. 2021 Jun 29;6(3):e0061921. doi: 10.1128/mSystems.00619-21. Epub 2021 Jun 15.
10
Clinical Evaluation of an Improved Metagenomic Next-Generation Sequencing Test for the Diagnosis of Bloodstream Infections.一种改进的宏基因组下一代测序检测用于血流感染诊断的临床评估
Clin Chem. 2021 Aug 5;67(8):1133-1143. doi: 10.1093/clinchem/hvab061.