• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CoverM:宏基因组学的读取比对统计信息。

CoverM: read alignment statistics for metagenomics.

作者信息

Aroney Samuel T N, Newell Rhys J P, Nissen Jakob N, Camargo Antonio Pedro, Tyson Gene W, Woodcroft Ben J

机构信息

Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology (QUT), Translational Research Institute, Woolloongabba 4102, Australia.

The Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen 2200, Denmark.

出版信息

Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf147.

DOI:10.1093/bioinformatics/btaf147
PMID:40193404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11993303/
Abstract

SUMMARY

Genome-centric analysis of metagenomic samples is a powerful method for understanding the function of microbial communities. Calculating read coverage is a central part of analysis, enabling differential coverage binning for recovery of genomes and estimation of microbial community composition. Coverage is determined by processing read alignments to reference sequences of either contigs or genomes. Per-reference coverage is typically calculated in an ad-hoc manner, with each software package providing its own implementation and specific definition of coverage. Here we present a unified software package CoverM which calculates several coverage statistics for contigs and genomes in an ergonomic and flexible manner. It uses "Mosdepth arrays" for computational efficiency and avoids unnecessary I/O overhead by calculating coverage statistics from streamed read alignment results.

AVAILABILITY AND IMPLEMENTATION

CoverM is free software available at https://github.com/wwood/coverm. CoverM is implemented in Rust, with Python (https://github.com/apcamargo/pycoverm) and Julia (https://github.com/JuliaBinaryWrappers/CoverM_jll.jl) interfaces.

摘要

摘要

以基因组为中心的宏基因组样本分析是了解微生物群落功能的有力方法。计算读取覆盖度是分析的核心部分,可实现差异覆盖度分箱以恢复基因组并估计微生物群落组成。覆盖度通过将读取比对结果处理到重叠群或基因组的参考序列来确定。每个参考序列的覆盖度通常以临时方式计算,每个软件包都提供自己的实现方式和覆盖度的特定定义。在这里,我们展示了一个统一的软件包CoverM,它以符合人体工程学且灵活的方式计算重叠群和基因组的多个覆盖度统计信息。它使用“Mosdepth数组”以提高计算效率,并通过从流式读取比对结果计算覆盖度统计信息来避免不必要的I/O开销。

可用性和实现方式

CoverM是免费软件,可在https://github.com/wwood/coverm上获取。CoverM用Rust实现,具有Python(https://github.com/apcamargo/pycoverm)和Julia(https://github.com/JuliaBinaryWrappers/CoverM_jll.jl)接口。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a96c/11993303/9f5b20f8c89c/btaf147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a96c/11993303/9f5b20f8c89c/btaf147f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a96c/11993303/9f5b20f8c89c/btaf147f1.jpg

相似文献

1
CoverM: read alignment statistics for metagenomics.CoverM:宏基因组学的读取比对统计信息。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf147.
2
COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.可口可乐:利用序列组成、读段覆盖度、共比对和双端读段连接对宏基因组重叠群进行分箱。
Bioinformatics. 2017 Mar 15;33(6):791-798. doi: 10.1093/bioinformatics/btw290.
3
Fairy: fast approximate coverage for multi-sample metagenomic binning. Fairy:用于多样本宏基因组 bin 快速近似覆盖的方法。
Microbiome. 2024 Aug 14;12(1):151. doi: 10.1186/s40168-024-01861-6.
4
CoCoNet: an efficient deep learning tool for viral metagenome binning.CoCoNet:一种用于病毒宏基因组分箱的高效深度学习工具。
Bioinformatics. 2021 Sep 29;37(18):2803-2810. doi: 10.1093/bioinformatics/btab213.
5
GraphBin: refined binning of metagenomic contigs using assembly graphs.GraphBin:使用组装图对宏基因组序列进行精细化分箱。
Bioinformatics. 2020 Jun 1;36(11):3307-3313. doi: 10.1093/bioinformatics/btaa180.
6
AFITbin: a metagenomic contig binning method using aggregate l-mer frequency based on initial and terminal nucleotides.AfiTbin:一种基于初始和末端核苷酸的基于聚合 l-mer 频率的宏基因组序列拼接方法。
BMC Bioinformatics. 2024 Jul 16;25(1):241. doi: 10.1186/s12859-024-05859-7.
7
MetaBCC-LR: metagenomics binning by coverage and composition for long reads.MetaBCC-LR:基于覆盖度和组成的长读长宏基因组 bin 划分。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i3-i11. doi: 10.1093/bioinformatics/btaa441.
8
: a simple, efficient, flexible and scalable workflow to reconstruct prokaryotic genomes from metagenomes.一种简单、高效、灵活和可扩展的工作流程,用于从宏基因组中重建原核基因组。
F1000Res. 2022 Dec 15;11:1522. doi: 10.12688/f1000research.128091.2. eCollection 2022.
9
Integrating taxonomic signals from MAGs and contigs improves read annotation and taxonomic profiling of metagenomes.将宏基因组和 contigs 的分类学信号进行整合,可以提高宏基因组的读注释和分类学分析。
Nat Commun. 2024 Apr 20;15(1):3373. doi: 10.1038/s41467-024-47155-1.
10
Mosdepth: quick coverage calculation for genomes and exomes.Mosdepth:基因组和外显子组的快速覆盖度计算。
Bioinformatics. 2018 Mar 1;34(5):867-868. doi: 10.1093/bioinformatics/btx699.

引用本文的文献

1
Coastal methane emissions driven by aerotolerant methanogens using seaweed and seagrass metabolites.耐氧产甲烷菌利用海藻和海草代谢产物驱动的沿海甲烷排放。
Nat Geosci. 2025;18(9):854-861. doi: 10.1038/s41561-025-01768-3. Epub 2025 Aug 7.
2
Exploring extreme environments in Türkiye for novel P450s through metagenomic analysis.通过宏基因组分析在土耳其探索极端环境以寻找新型细胞色素P450。
PLoS One. 2025 Sep 8;20(9):e0330523. doi: 10.1371/journal.pone.0330523. eCollection 2025.
3
Distinct microbial communities within and on seep carbonates support long-term anaerobic oxidation of methane and divergent pMMO diversity.

本文引用的文献

1
Fairy: fast approximate coverage for multi-sample metagenomic binning. Fairy:用于多样本宏基因组 bin 快速近似覆盖的方法。
Microbiome. 2024 Aug 14;12(1):151. doi: 10.1186/s40168-024-01861-6.
2
SPIRE: a Searchable, Planetary-scale mIcrobiome REsource.SPIRE:一个可搜索的、行星规模的微生物组资源。
Nucleic Acids Res. 2024 Jan 5;52(D1):D777-D783. doi: 10.1093/nar/gkad943.
3
Fast and robust metagenomic sequence comparison through sparse chaining with skani.通过使用 skani 进行稀疏链接实现快速稳健的宏基因组序列比较。
渗漏碳酸盐岩内部和表面不同的微生物群落支持甲烷的长期厌氧氧化和不同的颗粒甲烷单加氧酶多样性。
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf153.
4
Strain-Level microbial signatures and inferred functional alterations in infants with Food Protein-Induced Allergic Proctocolitis.食物蛋白诱导的过敏性直肠结肠炎婴儿的菌株水平微生物特征及推断的功能改变
Res Sq. 2025 Aug 27:rs.3.rs-7112201. doi: 10.21203/rs.3.rs-7112201/v1.
5
Temporal dynamics and microbial interactions shaping the gut resistome in early infancy.塑造婴儿早期肠道耐药组的时间动态和微生物相互作用
Nat Commun. 2025 Aug 30;16(1):8139. doi: 10.1038/s41467-025-63401-6.
6
Metagenomic analysis reveals how multiple stressors disrupt virus-host interactions in multi-trophic freshwater mesocosms.宏基因组分析揭示了多种压力源如何破坏多营养级淡水微宇宙中的病毒-宿主相互作用。
Nat Commun. 2025 Aug 21;16(1):7806. doi: 10.1038/s41467-025-63162-2.
7
A novel bacterial protein family that catalyses nitrous oxide reduction.一个催化一氧化二氮还原的新型细菌蛋白家族。
Nature. 2025 Aug 20. doi: 10.1038/s41586-025-09401-4.
8
Metagenomics reveals fibre fermentation and AMR pathways in red grouse (Lagopus scotica) microbiota.宏基因组学揭示了红松鸡(Lagopus scotica)微生物群中的纤维发酵和抗生素抗性途径。
BMC Microbiol. 2025 Aug 19;25(1):520. doi: 10.1186/s12866-025-04280-1.
9
A defined microbial community reproduces attributes of fine flavour chocolate fermentation.一个特定的微生物群落再现了优质风味巧克力发酵的特性。
Nat Microbiol. 2025 Aug 18. doi: 10.1038/s41564-025-02077-6.
10
Impact of early life antibiotic and probiotic treatment on gut microbiome and resistome of very-low-birth-weight preterm infants.早期抗生素和益生菌治疗对极低出生体重早产儿肠道微生物群和耐药基因组的影响。
Nat Commun. 2025 Aug 14;16(1):7569. doi: 10.1038/s41467-025-62584-2.
Nat Methods. 2023 Nov;20(11):1661-1665. doi: 10.1038/s41592-023-02018-3. Epub 2023 Sep 21.
4
SemiBin2: self-supervised contrastive learning leads to better MAGs for short- and long-read sequencing.半Bin2:自监督对比学习可提高短读长读测序的宏基因组组装质量。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i21-i29. doi: 10.1093/bioinformatics/btad209.
5
Strobealign: flexible seed size enables ultra-fast and accurate read alignment.Strobealign:灵活的种子大小可实现超快速和准确的读取对齐。
Genome Biol. 2022 Dec 15;23(1):260. doi: 10.1186/s13059-022-02831-7.
6
The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments.海洋 DNA MAG 目录包含了超过 50000 个源自各种海洋环境的原核生物基因组。
Sci Data. 2022 Jun 17;9(1):305. doi: 10.1038/s41597-022-01392-5.
7
A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments.深度暹罗神经网络提高了不同环境中微生物组数据集的宏基因组组装基因组。
Nat Commun. 2022 Apr 28;13(1):2326. doi: 10.1038/s41467-022-29843-y.
8
Challenges in benchmarking metagenomic profilers.宏基因组分析工具的基准测试挑战。
Nat Methods. 2021 Jun;18(6):618-626. doi: 10.1038/s41592-021-01141-3. Epub 2021 May 13.
9
HTSlib: C library for reading/writing high-throughput sequencing data.HTSlib:用于读取/写入高通量测序数据的 C 库。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab007.
10
Improved metagenome binning and assembly using deep variational autoencoders.利用深度变分自动编码器改进宏基因组的分类和组装。
Nat Biotechnol. 2021 May;39(5):555-560. doi: 10.1038/s41587-020-00777-4. Epub 2021 Jan 4.