• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CAIM:基于覆盖度的微生物组分析方法。

CAIM: coverage-based analysis for identification of microbiome.

机构信息

Department of Biomedical Informatics, University of Arkansas for Medical Sciences, 4301 W Markham St, Little Rock, AR 72205, United States.

Stowers Institute for Medical Research, 1000 E 50 St, Kansas City, MO 64110, United States.

出版信息

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.

DOI:10.1093/bib/bbae424
PMID:39222062
原文链接:
https://pmc.ncbi.nlm.nih.gov/articles/PMC11367759/
Abstract

Accurate taxonomic profiling of microbial taxa in a metagenomic sample is vital to gain insights into microbial ecology. Recent advancements in sequencing technologies have contributed tremendously toward understanding these microbes at species resolution through a whole shotgun metagenomic approach. In this study, we developed a new bioinformatics tool, coverage-based analysis for identification of microbiome (CAIM), for accurate taxonomic classification and quantification within both long- and short-read metagenomic samples using an alignment-based method. CAIM depends on two different containment techniques to identify species in metagenomic samples using their genome coverage information to filter out false positives rather than the traditional approach of relative abundance. In addition, we propose a nucleotide-count-based abundance estimation, which yield lesser root mean square error than the traditional read-count approach. We evaluated the performance of CAIM on 28 metagenomic mock communities and 2 synthetic datasets by comparing it with other top-performing tools. CAIM maintained a consistently good performance across datasets in identifying microbial taxa and in estimating relative abundances than other tools. CAIM was then applied to a real dataset sequenced on both Nanopore (with and without amplification) and Illumina sequencing platforms and found high similarity of taxonomic profiles between the sequencing platforms. Lastly, CAIM was applied to fecal shotgun metagenomic datasets of 232 colorectal cancer patients and 229 controls obtained from 4 different countries and 44 primary liver cancer patients and 76 controls. The predictive performance of models using the genome-coverage cutoff was better than those using the relative-abundance cutoffs in discriminating colorectal cancer and primary liver cancer patients from healthy controls with a highly confident species markers.

摘要

准确地对宏基因组样本中的微生物分类群进行分类对于深入了解微生物生态学至关重要。最近测序技术的进步通过全基因组 shotgun 宏基因组方法极大地促进了对这些微生物在物种分辨率水平上的理解。在这项研究中,我们开发了一种新的生物信息学工具,基于覆盖度的微生物组分析(CAIM),用于使用基于比对的方法对长读长和短读长宏基因组样本进行准确的分类和定量。CAIM 依赖于两种不同的包含技术,使用其基因组覆盖度信息来识别宏基因组样本中的物种,从而过滤掉假阳性,而不是传统的相对丰度方法。此外,我们提出了一种基于核苷酸计数的丰度估计方法,其均方根误差小于传统的读计数方法。我们通过将 CAIM 与其他表现最佳的工具进行比较,在 28 个宏基因组模拟群落和 2 个合成数据集上评估了其性能。CAIM 在识别微生物分类群和估计相对丰度方面的表现始终优于其他工具。然后,我们将 CAIM 应用于在 Nanopore (扩增和不扩增)和 Illumina 测序平台上测序的真实数据集,发现测序平台之间的分类群谱具有高度相似性。最后,CAIM 应用于来自 4 个不同国家的 232 名结直肠癌患者和 229 名对照者以及 44 名原发性肝癌患者和 76 名对照者的粪便 shotgun 宏基因组数据集。使用基因组覆盖度截止值的模型的预测性能优于使用相对丰度截止值的模型,在区分结直肠癌和原发性肝癌患者与健康对照者时,具有高度置信度的物种标志物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/78a69457304b/bbae424f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/14aa38fcaba0/bbae424f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/de9ec0786d9b/bbae424f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/22c3c23a31c7/bbae424f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/975fb695bd85/bbae424f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/78a69457304b/bbae424f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/14aa38fcaba0/bbae424f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/de9ec0786d9b/bbae424f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/22c3c23a31c7/bbae424f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/975fb695bd85/bbae424f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf1/11367759/78a69457304b/bbae424f5.jpg

相似文献

1
CAIM: coverage-based analysis for identification of microbiome.CAIM:基于覆盖度的微生物组分析方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.
2
CAIM: Coverage-based Analysis for Identification of Microbiome.CAIM:基于覆盖度的微生物组识别分析
bioRxiv. 2024 May 15:2024.04.25.591018. doi: 10.1101/2024.04.25.591018.
3
Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。
BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.
4
Species classifier choice is a key consideration when analysing low-complexity food microbiome data.在分析低复杂度食品微生物组数据时,物种分类器的选择是一个关键考虑因素。
Microbiome. 2018 Mar 20;6(1):50. doi: 10.1186/s40168-018-0437-0.
5
Shotgun metagenome data of a defined mock community using Oxford Nanopore, PacBio and Illumina technologies.使用 Oxford Nanopore、PacBio 和 Illumina 技术对一个定义的模拟群落进行的 shotgun 宏基因组数据。
Sci Data. 2019 Nov 26;6(1):285. doi: 10.1038/s41597-019-0287-z.
6
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
7
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
8
Practical considerations for sampling and data analysis in contemporary metagenomics-based environmental studies.当代基于宏基因组学的环境研究中采样与数据分析的实际考量
J Microbiol Methods. 2018 Nov;154:14-18. doi: 10.1016/j.mimet.2018.09.020. Epub 2018 Oct 1.
9
Microbial community analysis using high-throughput sequencing technology: a beginner's guide for microbiologists.使用高通量测序技术进行微生物群落分析:微生物学家的入门指南。
J Microbiol. 2020 Mar;58(3):176-192. doi: 10.1007/s12275-020-9525-5. Epub 2020 Feb 27.
10
Estimating the total genome length of a metagenomic sample using k-mers.利用 k- -mer 估算宏基因组样本的总基因组长度。
BMC Genomics. 2019 Apr 4;20(Suppl 2):183. doi: 10.1186/s12864-019-5467-x.

引用本文的文献

1
Precise and scalable metagenomic profiling with sample-tailored minimizer libraries.使用样本定制的最小化子库进行精确且可扩展的宏基因组分析。
NAR Genom Bioinform. 2025 Jun 9;7(2):lqaf076. doi: 10.1093/nargab/lqaf076. eCollection 2025 Jun.

本文引用的文献

1
Gut dysbiosis in Thai intrahepatic cholangiocarcinoma and hepatocellular carcinoma.泰国肝内胆管癌和肝细胞癌的肠道菌群失调。
Sci Rep. 2023 Jul 14;13(1):11406. doi: 10.1038/s41598-023-38307-2.
2
KMCP: accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping.KMCP:通过伪映射对原核生物和病毒种群进行准确的宏基因组分析。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac845.
3
Performance of Nanopore and Illumina Metagenomic Sequencing for Pathogen Detection and Transcriptome Analysis in Infantile Central Nervous System Infections.
纳米孔测序和Illumina宏基因组测序在婴幼儿中枢神经系统感染病原体检测及转录组分析中的性能
Open Forum Infect Dis. 2022 Sep 30;9(10):ofac504. doi: 10.1093/ofid/ofac504. eCollection 2022 Oct.
4
Increasing prediction performance of colorectal cancer disease status using random forests classification based on metagenomic shotgun sequencing data.基于宏基因组鸟枪法测序数据,利用随机森林分类法提高结直肠癌疾病状态的预测性能。
Synth Syst Biotechnol. 2022 Jan 27;7(1):574-585. doi: 10.1016/j.synbio.2022.01.005. eCollection 2022 Mar.
5
Metagenomic profiling of host-associated bacteria from 8 datasets of the red alga Porphyra purpurea with MetaPhlAn3.运用 MetaPhlAn3 对紫菜(Porphyra purpurea)8 个数据集的宿主相关细菌进行宏基因组分析。
Mar Genomics. 2021 Oct;59:100866. doi: 10.1016/j.margen.2021.100866. Epub 2021 Apr 1.
6
Bacterial community structure alterations within the colorectal cancer gut microbiome.结直肠癌肠道微生物组中细菌群落结构的改变。
BMC Microbiol. 2021 Mar 31;21(1):98. doi: 10.1186/s12866-021-02153-x.
7
Kssd: sequence dimensionality reduction by k-mer substring space sampling enables real-time large-scale datasets analysis.Kssd:通过 K-mer 子串空间采样进行序列降维,实现实时大规模数据集分析。
Genome Biol. 2021 Mar 16;22(1):84. doi: 10.1186/s13059-021-02303-4.
8
KITSUNE: A Tool for Identifying Empirically Optimal K-mer Length for Alignment-Free Phylogenomic Analysis.KITSUNE:一种用于为无比对系统发育基因组分析确定经验最优k-mer长度的工具。
Front Bioeng Biotechnol. 2020 Sep 23;8:556413. doi: 10.3389/fbioe.2020.556413. eCollection 2020.
9
Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data.菌落和宏基因组测序数据中菌株水平微生物检测的计算方法
Front Microbiol. 2020 Aug 18;11:1925. doi: 10.3389/fmicb.2020.01925. eCollection 2020.
10
Metalign: efficient alignment-based metagenomic profiling via containment min hash.Metalign:基于包含最小哈希的高效基于比对的宏基因组分析。
Genome Biol. 2020 Sep 10;21(1):242. doi: 10.1186/s13059-020-02159-0.