• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因组的肠杆菌目种快速划分。

Fast genome-based delimitation of Enterobacterales species.

机构信息

Department of Biology, Wilfrid Laurier University, Waterloo, ON, Canada.

出版信息

PLoS One. 2023 Sep 14;18(9):e0291492. doi: 10.1371/journal.pone.0291492. eCollection 2023.

DOI:10.1371/journal.pone.0291492
PMID:37708115
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10501659/
Abstract

Average Nucleotide Identity (ANI) is becoming a standard measure for bacterial species delimitation. However, its calculation can take orders of magnitude longer than similarity estimates based on sampling of short nucleotides, compiled into so-called sketches. These estimates are widely used. However, their variable correlation with ANI has suggested that they might not be as accurate. For a where-the-rubber-meets-the-road assessment, we compared two sketching programs, mash and dashing, against ANI, in delimiting species among Esterobacterales genomes. Receiver Operating Characteristic (ROC) analysis found Area Under the Curve (AUC) values of 0.99, almost perfect species discrimination for all three measures. Subsampling to avoid over-represented species reduced these AUC values to 0.92, still highly accurate. Focused tests with ten genera, each represented by more than three species, also showed almost identical results for all methods. Shigella showed the lowest AUC values (0.68), followed by Citrobacter (0.80). All other genera, Dickeya, Enterobacter, Escherichia, Klebsiella, Pectobacterium, Proteus, Providencia and Yersinia, produced AUC values above 0.90. The species delimitation thresholds varied, with species distance ranges in a few genera overlapping the genus ranges of other genera. Mash was able to separate the E. coli + Shigella complex into 25 apparent phylogroups, four of them corresponding, roughly, to the four Shigella species represented in the data. Our results suggest that fast estimates of genome similarity are as good as ANI for species delimitation. Therefore, these estimates might suffice for covering the role of genomic similarity in bacterial taxonomy, and should increase confidence in their use for efficient bacterial identification and clustering, from epidemiological to genome-based detection of potential contaminants in farming and industry settings.

摘要

平均核苷酸同一性 (ANI) 正成为细菌物种划分的标准衡量标准。然而,它的计算时间可能比基于短核苷酸采样的相似性估计长得多,这些估计被汇编成所谓的草图。这些估计被广泛使用。然而,它们与 ANI 的可变相关性表明,它们可能并不那么准确。为了进行实地评估,我们比较了两种草图程序 mash 和 dashing 与 ANI 之间的差异,以确定 Esterobacterales 基因组中的物种界限。接收者操作特征 (ROC) 分析发现,所有三种方法的曲线下面积 (AUC) 值均为 0.99,几乎完美地实现了物种区分。通过避免过度代表物种的抽样,这些 AUC 值降低到 0.92,但仍然非常准确。对十个属进行的重点测试,每个属由三个以上的物种代表,所有方法也显示出几乎相同的结果。志贺氏菌的 AUC 值最低 (0.68),其次是柠檬酸杆菌 (0.80)。所有其他属,如狄克氏菌、肠杆菌、大肠杆菌、克雷伯氏菌、果胶杆菌、变形杆菌、普罗维登斯菌和耶尔森氏菌,其 AUC 值均高于 0.90。物种划分阈值有所不同,一些属的物种距离范围与其他属的属范围重叠。mash 能够将大肠杆菌+志贺氏菌复合体分为 25 个明显的系统发育群,其中四个大致对应于数据中代表的四个志贺氏菌种。我们的结果表明,快速估计基因组相似性与 ANI 一样适用于物种划分。因此,这些估计可能足以涵盖基因组相似性在细菌分类学中的作用,并且应该增加对其在细菌识别和聚类中的使用的信心,从流行病学到基于基因组的对农业和工业环境中潜在污染物的检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/de444956294c/pone.0291492.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/543fdc2c94d2/pone.0291492.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/49bf8a399121/pone.0291492.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/de444956294c/pone.0291492.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/543fdc2c94d2/pone.0291492.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/49bf8a399121/pone.0291492.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a02b/10501659/de444956294c/pone.0291492.g003.jpg

相似文献

1
Fast genome-based delimitation of Enterobacterales species.基于基因组的肠杆菌目种快速划分。
PLoS One. 2023 Sep 14;18(9):e0291492. doi: 10.1371/journal.pone.0291492. eCollection 2023.
2
FastANI, Mash and Dashing equally differentiate between species.FastANI、Mash 和 Dashing 均可区分物种。
PeerJ. 2022 Jul 21;10:e13784. doi: 10.7717/peerj.13784. eCollection 2022.
3
A re-evaluation of the taxonomy of phytopathogenic genera Dickeya and Pectobacterium using whole-genome sequencing data.利用全基因组测序数据对植物致病属Dickeya和果胶杆菌属的分类学进行重新评估。
Syst Appl Microbiol. 2016 Jun;39(4):252-259. doi: 10.1016/j.syapm.2016.04.001. Epub 2016 Apr 20.
4
Conserved signature indels and signature proteins as novel tools for understanding microbial phylogeny and systematics: identification of molecular signatures that are specific for the phytopathogenic genera Dickeya, Pectobacterium and Brenneria.保守的特征缺失和特征蛋白作为理解微生物系统发育和系统分类的新工具:鉴定出对植物病原菌 Dickeya、果胶杆菌和韧皮部杆菌属特异的分子特征。
Int J Syst Evol Microbiol. 2014 Feb;64(Pt 2):366-383. doi: 10.1099/ijs.0.054213-0.
5
A genomic distance based on MUM indicates discontinuity between most bacterial species and genera.基于最大唯一匹配(MUM)的基因组距离表明,大多数细菌物种和属之间存在间断性。
J Bacteriol. 2009 Jan;191(1):91-9. doi: 10.1128/JB.01202-08. Epub 2008 Oct 31.
6
Transcriptome and Comparative Genomics Analyses Reveal New Functional Insights on Key Determinants of Pathogenesis and Interbacterial Competition in and spp.转录组和比较基因组学分析揭示了 和 种属中关键致病和种间竞争决定因素的新功能见解。
Appl Environ Microbiol. 2019 Jan 9;85(2). doi: 10.1128/AEM.02050-18. Print 2019 Jan 15.
7
BMScan: using whole genome similarity to rapidly and accurately identify bacterial meningitis causing species.BMScan:利用全基因组相似度快速准确地鉴定细菌性脑膜炎致病菌种。
BMC Infect Dis. 2018 Aug 15;18(1):405. doi: 10.1186/s12879-018-3324-1.
8
Diversity within the complex, identification of and members, proposal of the novel species sp. nov.种内多样性的复杂性,鉴定成员,提出新物种 sp. nov.
Int J Syst Evol Microbiol. 2021 Nov;71(11). doi: 10.1099/ijsem.0.005059.
9
Genome sequence-based criteria for demarcation and definition of species in the genus .基于基因组序列的. 属内种的划分和定义标准
Int J Syst Evol Microbiol. 2020 Mar;70(3):1738-1750. doi: 10.1099/ijsem.0.003963.
10
LINflow: a computational pipeline that combines an alignment-free with an alignment-based method to accelerate generation of similarity matrices for prokaryotic genomes.LINflow:一种计算流程,它将一种无比对方法与一种基于比对的方法相结合,以加速原核生物基因组相似性矩阵的生成。
PeerJ. 2021 Mar 24;9:e10906. doi: 10.7717/peerj.10906. eCollection 2021.

引用本文的文献

1
HyperGen: Compact and Efficient Genome Sketching using Hyperdimensional Vectors.HyperGen:使用超维向量进行紧凑且高效的基因组草图绘制
Bioinformatics. 2024 Jul 16;40(7). doi: 10.1093/bioinformatics/btae452.

本文引用的文献

1
Escherichia Coli: What Is and Which Are?大肠杆菌:是什么和有哪些?
Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac273.
2
Introducing the Bacterial and Viral Bioinformatics Resource Center (BV-BRC): a resource combining PATRIC, IRD and ViPR.推出细菌和病毒生物信息学资源中心(BV-BRC):一个整合 PATRIC、IRD 和 ViPR 的资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D678-D689. doi: 10.1093/nar/gkac1003.
3
SeqCode: a nomenclatural code for prokaryotes described from sequence data.序列码:一种基于序列数据描述的原核生物命名代码。
Nat Microbiol. 2022 Oct;7(10):1702-1708. doi: 10.1038/s41564-022-01214-9. Epub 2022 Sep 19.
4
FastANI, Mash and Dashing equally differentiate between species.FastANI、Mash 和 Dashing 均可区分物种。
PeerJ. 2022 Jul 21;10:e13784. doi: 10.7717/peerj.13784. eCollection 2022.
5
GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy.GTDB:通过系统发生一致、等级归一化和基于完整基因组的分类学,对细菌和古菌多样性进行持续普查。
Nucleic Acids Res. 2022 Jan 7;50(D1):D785-D794. doi: 10.1093/nar/gkab776.
6
Re-evaluating the evidence for a universal genetic boundary among microbial species.重新评估微生物物种间普遍存在的遗传界限的证据。
Nat Commun. 2021 Jul 7;12(1):4059. doi: 10.1038/s41467-021-24128-2.
7
Reply to: "Re-evaluating the evidence for a universal genetic boundary among microbial species".回复:“重新评估微生物物种间普遍遗传界限的证据”
Nat Commun. 2021 Jul 7;12(1):4060. doi: 10.1038/s41467-021-24129-1.
8
A standardized archaeal taxonomy for the Genome Taxonomy Database.基于基因组分类数据库的标准化古菌分类学。
Nat Microbiol. 2021 Jul;6(7):946-959. doi: 10.1038/s41564-021-00918-8. Epub 2021 Jun 21.
9
ggtreeExtra: Compact Visualization of Richly Annotated Phylogenetic Data.ggtreeExtra:丰富注释的系统发育数据的紧凑可视化。
Mol Biol Evol. 2021 Aug 23;38(9):4039-4042. doi: 10.1093/molbev/msab166.
10
Genotypic Characterization of Clinical spp. Isolates Collected From Patients With Suspected Community-Onset Sepsis, Sweden.从瑞典疑似社区获得性脓毒症患者中分离出的临床菌株的基因型特征分析
Front Microbiol. 2021 Apr 30;12:640408. doi: 10.3389/fmicb.2021.640408. eCollection 2021.