• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CheckM:评估从分离株、单细胞和宏基因组中获得的微生物基因组质量。

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

作者信息

Parks Donovan H, Imelfort Michael, Skennerton Connor T, Hugenholtz Philip, Tyson Gene W

机构信息

Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, QLD 4072, Queensland, Australia;

Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, QLD 4072, Queensland, Australia; Institute for Molecular Bioscience, The University of Queensland, St. Lucia, QLD 4072, Queensland, Australia;

出版信息

Genome Res. 2015 Jul;25(7):1043-55. doi: 10.1101/gr.186072.114. Epub 2015 May 14.

DOI:10.1101/gr.186072.114
PMID:25977477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4484387/
Abstract

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of "marker" genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.

摘要

计算方法的进步以及测序成本的大幅降低,使得从分离株、单细胞和宏基因组数据中大规模恢复基因组成为可能。尽管草图基因组的覆盖范围不断扩大,为微生物生命的进化和功能多样性提供了关键信息,但完成所有可用的参考基因组已变得不切实际。从草图基因组中做出可靠的生物学推断需要准确估计其完整性和污染程度。当前评估基因组质量的方法是临时的,通常利用在所有细菌或古细菌基因组中保守的有限数量的“标记”基因。在这里,我们介绍了CheckM,这是一种自动化方法,用于使用更广泛的标记基因集来评估基因组质量,这些标记基因特定于参考基因组树中基因组的位置以及这些基因的排列信息。我们使用合成数据以及各种源自分离株、单细胞和宏基因组的基因组,证明了CheckM的有效性。结果表明,CheckM能够准确估计基因组的完整性和污染程度,并且优于现有方法。使用CheckM,我们识别出了目前影响公开可用分离株基因组的各种错误,并证明从单细胞和宏基因组数据获得的基因组在质量上存在很大差异。为了便于使用草图基因组,我们提出了一种基因组质量的客观衡量标准,可用于选择适合微生物群落特定基因和基因组分析的基因组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/56fcae9e38f7/1043f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/241d4aea6a97/1043f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/75b945c987e3/1043f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/10b6ac2cc67a/1043f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/4899f695bf26/1043f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/56fcae9e38f7/1043f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/241d4aea6a97/1043f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/75b945c987e3/1043f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/10b6ac2cc67a/1043f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/4899f695bf26/1043f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c66f/4484387/56fcae9e38f7/1043f05.jpg

相似文献

1
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.CheckM:评估从分离株、单细胞和宏基因组中获得的微生物基因组质量。
Genome Res. 2015 Jul;25(7):1043-55. doi: 10.1101/gr.186072.114. Epub 2015 May 14.
2
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.近 8000 个宏基因组组装基因组的恢复极大地扩展了生命之树。
Nat Microbiol. 2017 Nov;2(11):1533-1542. doi: 10.1038/s41564-017-0012-7. Epub 2017 Sep 11.
3
Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics.通过单细胞基因组学和宏基因组学的整合框架从人类微生物组中恢复菌株解析基因组。
Microbiome. 2021 Oct 12;9(1):202. doi: 10.1186/s40168-021-01152-4.
4
Recovering complete and draft population genomes from metagenome datasets.从宏基因组数据集中恢复完整和草图的种群基因组。
Microbiome. 2016 Mar 8;4:8. doi: 10.1186/s40168-016-0154-5.
5
A machine learning-based service for estimating quality of genomes using PATRIC.基于机器学习的 PATRIC 基因组质量估算服务。
BMC Bioinformatics. 2019 Oct 3;20(1):486. doi: 10.1186/s12859-019-3068-y.
6
The Reliability of Metagenome-Assembled Genomes (MAGs) in Representing Natural Populations: Insights from Comparing MAGs against Isolate Genomes Derived from the Same Fecal Sample.宏基因组组装基因组(MAGs)在代表自然种群方面的可靠性:来自比较源自同一粪便样本的分离基因组的 MAGs 的见解。
Appl Environ Microbiol. 2021 Feb 26;87(6). doi: 10.1128/AEM.02593-20.
7
dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication.dRep:一种用于快速准确基因组比较的工具,可通过去重复从宏基因组中实现更好的基因组恢复。
ISME J. 2017 Dec;11(12):2864-2868. doi: 10.1038/ismej.2017.126. Epub 2017 Jul 25.
8
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
9
MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.MaxBin 2.0:一种从多个宏基因组数据集中恢复基因组的自动分箱算法。
Bioinformatics. 2016 Feb 15;32(4):605-7. doi: 10.1093/bioinformatics/btv638. Epub 2015 Oct 29.
10
Genome-resolved metagenomics using environmental and clinical samples.基于环境和临床样本的基因组解析宏基因组学。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab030.

引用本文的文献

1
Four novel species of the genus Aquimarina isolated from marine sponges: Aquimarina callyspongiae sp. nov., Aquimarina mycalae sp. nov., Aquimarina discodermiae sp. nov., and Aquimarina penaris sp. nov.从海洋海绵中分离出的四种新的海生水杆菌属物种:新种卡利斯海绵海生水杆菌、新种麦卡拉海生水杆菌、新种盘皮海绵海生水杆菌和新种笔管海生水杆菌。
Antonie Van Leeuwenhoek. 2025 Sep 5;118(10):150. doi: 10.1007/s10482-025-02156-y.
2
The Earth Hologenome Initiative: Data Release 1.地球全基因组计划:数据发布1
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf102.
3
Next-generation sequencing applications in food science: fundamentals and recent advances.

本文引用的文献

1
First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking.对导致废水膨胀的候选细菌门成员的首次基因组洞察。
PeerJ. 2015 Jan 27;3:e740. doi: 10.7717/peerj.740. eCollection 2015.
2
Taxator-tk: precise taxonomic assignment of metagenomes by fast approximation of evolutionary neighborhoods.Taxator-tk:通过快速近似进化邻域对宏基因组进行精确的分类学归属
Bioinformatics. 2015 Mar 15;31(6):817-24. doi: 10.1093/bioinformatics/btu745. Epub 2014 Nov 10.
3
GroopM: an automated tool for the recovery of population genomes from related metagenomes.
下一代测序技术在食品科学中的应用:基础与最新进展
Front Bioeng Biotechnol. 2025 Aug 20;13:1638957. doi: 10.3389/fbioe.2025.1638957. eCollection 2025.
4
High-Abundance Heterotrophic Bacteria Inhabit the 85° E Hydrothermal Plume of the Explosive Volcanic Zone at Gakkel Ridge, Arctic Ocean.高丰度异养细菌栖息于北冰洋加克尔海岭火山爆发区85°E热液羽流中。
Biology (Basel). 2025 Aug 12;14(8):1036. doi: 10.3390/biology14081036.
5
Host Shaping Associated Microbiota in Hydrothermal Vent Snails from the Indian Ocean Ridge.印度洋中脊热液喷口蜗牛宿主塑造相关微生物群
Biology (Basel). 2025 Jul 29;14(8):954. doi: 10.3390/biology14080954.
6
Plastispheres as reservoirs of antimicrobial resistance: Insights from metagenomic analyses across aquatic environments.作为抗菌素耐药性储存库的塑料球:来自对水生环境宏基因组分析的见解
PLoS One. 2025 Sep 3;20(9):e0330754. doi: 10.1371/journal.pone.0330754. eCollection 2025.
7
Chromosome-scale genome assembly and gene annotation of the hydrothermal vent annelid Alvinella pompejana yield insight into animal evolution in extreme environments.热液喷口环节动物庞贝蠕虫的染色体水平基因组组装和基因注释为极端环境中的动物进化提供了见解。
BMC Biol. 2025 Sep 2;23(1):274. doi: 10.1186/s12915-025-02369-7.
8
High-quality genome assembly and annotation of live animal vaccine bacteria strains in South Korea.韩国活体动物疫苗细菌菌株的高质量基因组组装与注释
BMC Genom Data. 2025 Sep 2;26(1):62. doi: 10.1186/s12863-025-01357-8.
9
Conservation of sporulation genes and a transmembrane-containing Spo0B variant in .芽孢形成基因及含跨膜结构域的Spo0B变体在……中的保守性
bioRxiv. 2025 Aug 24:2025.08.24.672004. doi: 10.1101/2025.08.24.672004.
10
The chromosomal genome sequence of the giant barrel sponge, Schmidt 1870 and its associated microbial metagenome sequences.巨型桶状海绵(施密特,1870年)的染色体基因组序列及其相关微生物宏基因组序列。
Wellcome Open Res. 2025 Jul 8;10:336. doi: 10.12688/wellcomeopenres.24173.1. eCollection 2025.
GroopM:一种从相关宏基因组中恢复种群基因组的自动化工具。
PeerJ. 2014 Sep 30;2:e603. doi: 10.7717/peerj.603. eCollection 2014.
4
An expanded genomic representation of the phylum cyanobacteria.蓝细菌门的扩展基因组图谱。
Genome Biol Evol. 2014 May;6(5):1031-45. doi: 10.1093/gbe/evu073.
5
Hundreds of circular novel plasmids and DNA elements identified in a rat cecum metamobilome.在大鼠盲肠动态可移动基因组中鉴定出数百种环状新型质粒和DNA元件。
PLoS One. 2014 Feb 4;9(2):e87924. doi: 10.1371/journal.pone.0087924. eCollection 2014.
6
PhyloSift: phylogenetic analysis of genomes and metagenomes.PhyloSift:基因组和宏基因组的系统发育分析。
PeerJ. 2014 Jan 9;2:e243. doi: 10.7717/peerj.243. eCollection 2014.
7
Pfam: the protein families database.Pfam:蛋白质家族数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.
8
Microbiology. Genomes from metagenomics.微生物学。宏基因组学的基因组。
Science. 2013 Nov 29;342(6162):1057-8. doi: 10.1126/science.1247023.
9
IMG 4 version of the integrated microbial genomes comparative analysis system.IMG 4 版整合微生物基因组比较分析系统。
Nucleic Acids Res. 2014 Jan;42(Database issue):D560-7. doi: 10.1093/nar/gkt963. Epub 2013 Oct 27.
10
PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes.PhyloPhlAn 是一种用于改进微生物系统发育和分类位置的新方法。
Nat Commun. 2013;4:2304. doi: 10.1038/ncomms3304.