• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对细菌双链DNA病毒基因图谱的探索揭示了广泛镶嵌现象中的ANI差距。

Exploration of the genetic landscape of bacterial dsDNA viruses reveals an ANI gap amid extensive mosaicism.

作者信息

Ndovie Wanangwa, Havránek Jan, Leconte Jade, Koszucki Janusz, Chindelevitch Leonid, Adriaenssens Evelien M, Mostowy Rafal J

机构信息

Malopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland.

Doctoral School of Exact and Natural Sciences, Jagiellonian University, Kraków, Poland.

出版信息

mSystems. 2025 Feb 18;10(2):e0166124. doi: 10.1128/msystems.01661-24. Epub 2025 Jan 29.

DOI:10.1128/msystems.01661-24
PMID:39878503
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11834439/
Abstract

Average nucleotide identity (ANI) is a widely used metric to estimate genetic relatedness, especially in microbial species delineation. While ANI calculation has been well optimized for bacteria and closely related viral genomes, accurate estimation of ANI below 80%, particularly in large reference data sets, has been challenging due to a lack of accurate and scalable methods. To bridge this gap, we introduce MANIAC, an efficient computational pipeline optimized for estimating ANI and alignment fraction (AF) in viral genomes with divergence around ANI of 70%. Using a rigorous simulation framework, we demonstrate MANIAC's accuracy and scalability compared to existing approaches, even to data sets of hundreds of thousands of viral genomes. Applying MANIAC to a curated data set of complete bacterial dsDNA viruses revealed a multimodal ANI distribution, with a distinct gap around 80%, akin to the bacterial ANI gap (~90%) but shifted, likely due to viral-specific evolutionary processes such as recombination dynamics and mosaicism. We then evaluated ANI and AF as predictors of genus-level taxonomy using a logistic regression model. We found that this model has strong predictive power (PR-AUC = 0.981), but that it works much better for virulent (PR-AUC = 0.997) than temperate (PR-AUC = 0.847) bacterial viruses. This highlights the complexity of taxonomic classification in temperate phages, known for their extensive mosaicism, and cautions against over-reliance on ANI in such cases. MANIAC can be accessed at https://github.com/bioinf-mcb/MANIAC.IMPORTANCEWe introduce a novel computational pipeline called MANIAC, designed to accurately assess average nucleotide identity (ANI) and alignment fraction (AF) between diverse viral genomes, scalable to data sets of over 100k genomes. Using computer simulations and real data analyses, we show that MANIAC could accurately estimate genetic relatedness between pairs of viral genomes of around 60%-70% ANI. We applied MANIAC to investigate the question of ANI discontinuity in bacterial dsDNA viruses, finding evidence for an ANI gap, akin to the one seen in bacteria but around ANI of 80%. We then assessed the ability of ANI and AF to predict taxonomic genus boundaries, finding its strong predictive power in virulent, but not in temperate phages. Our results suggest that bacterial dsDNA viruses may exhibit an ANI threshold (on average around 80%) above which recombination helps maintain population cohesiveness, as previously argued in bacteria.

摘要

平均核苷酸同一性(ANI)是一种广泛用于估计遗传相关性的指标,尤其在微生物物种划分中。虽然ANI计算已针对细菌和密切相关的病毒基因组进行了很好的优化,但由于缺乏准确且可扩展的方法,准确估计低于80%的ANI,特别是在大型参考数据集中,一直具有挑战性。为了弥补这一差距,我们引入了MANIAC,这是一种高效的计算流程,针对估计ANI约为70%的病毒基因组中的ANI和比对分数(AF)进行了优化。使用严格的模拟框架,我们展示了MANIAC与现有方法相比的准确性和可扩展性,甚至对于数十万个病毒基因组的数据集也是如此。将MANIAC应用于精心策划的完整细菌双链DNA病毒数据集,揭示了一种多峰ANI分布,在80%左右有明显的差距,类似于细菌的ANI差距(约90%),但有所偏移,这可能是由于病毒特异性的进化过程,如重组动态和镶嵌性。然后,我们使用逻辑回归模型评估ANI和AF作为属级分类学预测指标的能力。我们发现该模型具有很强的预测能力(PR-AUC = 0.981),但对于烈性细菌病毒(PR-AUC = 0.997)的效果比对温和细菌病毒(PR-AUC = 0.847)好得多。这突出了温和噬菌体分类学分类的复杂性,温和噬菌体以其广泛的镶嵌性而闻名,并警示在这种情况下不要过度依赖ANI。可在https://github.com/bioinf-mcb/MANIAC访问MANIAC。

重要性

我们引入了一种名为MANIAC的新型计算流程,旨在准确评估不同病毒基因组之间的平均核苷酸同一性(ANI)和比对分数(AF),可扩展到超过10万个基因组的数据集。通过计算机模拟和实际数据分析,我们表明MANIAC可以准确估计ANI约为60%-70%的病毒基因组对之间的遗传相关性。我们应用MANIAC来研究细菌双链DNA病毒中ANI不连续性的问题,发现了一个ANI差距的证据,类似于在细菌中看到的,但在ANI约为80%左右。然后,我们评估了ANI和AF预测分类属边界的能力,发现其在烈性噬菌体中有很强的预测能力,但在温和噬菌体中则不然。我们的结果表明,细菌双链DNA病毒可能表现出一个ANI阈值(平均约为80%),高于该阈值重组有助于维持种群凝聚力,正如之前在细菌中所论证的那样。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/fb9c4fe308d0/msystems.01661-24.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/c57aa3e9504e/msystems.01661-24.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/a2f918f3e08b/msystems.01661-24.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/8b35dd5243de/msystems.01661-24.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/1f8e54d99059/msystems.01661-24.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/8a9824452ef1/msystems.01661-24.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/fb9c4fe308d0/msystems.01661-24.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/c57aa3e9504e/msystems.01661-24.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/a2f918f3e08b/msystems.01661-24.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/8b35dd5243de/msystems.01661-24.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/1f8e54d99059/msystems.01661-24.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/8a9824452ef1/msystems.01661-24.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2edd/11834439/fb9c4fe308d0/msystems.01661-24.f006.jpg

相似文献

1
Exploration of the genetic landscape of bacterial dsDNA viruses reveals an ANI gap amid extensive mosaicism.对细菌双链DNA病毒基因图谱的探索揭示了广泛镶嵌现象中的ANI差距。
mSystems. 2025 Feb 18;10(2):e0166124. doi: 10.1128/msystems.01661-24. Epub 2025 Jan 29.
2
A natural ANI gap that can define intra-species units of bacteriophages and other viruses.一个可以定义噬菌体和其他病毒种内单位的自然平均核苷酸差异。
mBio. 2024 Aug 14;15(8):e0153624. doi: 10.1128/mbio.01536-24. Epub 2024 Jul 22.
3
Uncovering the boundaries of species through large-scale phylogenetic and nucleotide identity analyses.通过大规模的系统发育和核苷酸同源性分析揭示种的界限。
mSystems. 2024 Apr 16;9(4):e0121823. doi: 10.1128/msystems.01218-23. Epub 2024 Mar 26.
4
Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model.通过与动态语言模型相关的组合向量方法对大型双链 DNA 病毒和细小病毒进行全蛋白质组系统发育分析。
BMC Evol Biol. 2010 Jun 22;10:192. doi: 10.1186/1471-2148-10-192.
5
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.高通量 ANI 分析 9 万余组原核基因组揭示了清晰的物种界限。
Nat Commun. 2018 Nov 30;9(1):5114. doi: 10.1038/s41467-018-07641-9.
6
The Double-Stranded DNA Virosphere as a Modular Hierarchical Network of Gene Sharing.作为基因共享模块化层次网络的双链DNA病毒圈
mBio. 2016 Aug 2;7(4):e00978-16. doi: 10.1128/mBio.00978-16.
7
A large-scale evaluation of algorithms to calculate average nucleotide identity.计算平均核苷酸一致性的算法的大规模评估。
Antonie Van Leeuwenhoek. 2017 Oct;110(10):1281-1286. doi: 10.1007/s10482-017-0844-4. Epub 2017 Feb 15.
8
Bioinformatic genome comparisons for taxonomic and phylogenetic assignments using Aeromonas as a test case.以气单胞菌为测试案例,进行用于分类学和系统发育分析的生物信息学基因组比较。
mBio. 2014 Nov 18;5(6):e02136. doi: 10.1128/mBio.02136-14.
9
LINflow: a computational pipeline that combines an alignment-free with an alignment-based method to accelerate generation of similarity matrices for prokaryotic genomes.LINflow:一种计算流程,它将一种无比对方法与一种基于比对的方法相结合,以加速原核生物基因组相似性矩阵的生成。
PeerJ. 2021 Mar 24;9:e10906. doi: 10.7717/peerj.10906. eCollection 2021.
10
Using average nucleotide identity (ANI) to evaluate microsporidia species boundaries based on their genetic relatedness.利用平均核苷酸同一性(ANI),根据微孢子虫的遗传相关性评估其物种界限。
J Eukaryot Microbiol. 2023 Mar;70(2):e12944. doi: 10.1111/jeu.12944. Epub 2022 Sep 19.

引用本文的文献

1
EvANI benchmarking workflow for evolutionary distance estimation.用于进化距离估计的EvANI基准测试工作流程。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf267.
2
Deep learning-guided structural analysis of a novel bacteriophage KPP105 against multidrug-resistant .针对多重耐药菌的新型噬菌体KPP105的深度学习引导的结构分析
Comput Struct Biotechnol J. 2025 May 1;27:1827-1837. doi: 10.1016/j.csbj.2025.04.032. eCollection 2025.

本文引用的文献

1
A natural ANI gap that can define intra-species units of bacteriophages and other viruses.一个可以定义噬菌体和其他病毒种内单位的自然平均核苷酸差异。
mBio. 2024 Aug 14;15(8):e0153624. doi: 10.1128/mbio.01536-24. Epub 2024 Jul 22.
2
Sequence-discrete species for prokaryotes and other microbes: A historical perspective and pending issues.原核生物和其他微生物的序列离散物种:历史视角与待解决问题
mLife. 2023 Dec 11;2(4):341-349. doi: 10.1002/mlf2.12088. eCollection 2023 Dec.
3
Evolution of homologous recombination rates across bacteria.细菌中同源重组率的演变。
Proc Natl Acad Sci U S A. 2024 Apr 30;121(18):e2316302121. doi: 10.1073/pnas.2316302121. Epub 2024 Apr 24.
4
YACHT: an ANI-based statistical test to detect microbial presence/absence in a metagenomic sample.YACHT:一种基于ANI 的统计测试,用于检测宏基因组样本中的微生物存在/缺失。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae047.
5
An ANI gap within bacterial species that advances the definitions of intra-species units.种内 ANI 差距推进了种内单位的定义。
mBio. 2024 Jan 16;15(1):e0269623. doi: 10.1128/mbio.02696-23. Epub 2023 Dec 12.
6
Fast and robust metagenomic sequence comparison through sparse chaining with skani.通过使用 skani 进行稀疏链接实现快速稳健的宏基因组序列比较。
Nat Methods. 2023 Nov;20(11):1661-1665. doi: 10.1038/s41592-023-02018-3. Epub 2023 Sep 21.
7
Four principles to establish a universal virus taxonomy.建立通用病毒分类学的四个原则。
PLoS Biol. 2023 Feb 13;21(2):e3001922. doi: 10.1371/journal.pbio.3001922. eCollection 2023 Feb.
8
Gene flow and introgression are pervasive forces shaping the evolution of bacterial species.基因流和基因渗入是塑造细菌物种进化的普遍力量。
Genome Biol. 2022 Nov 10;23(1):239. doi: 10.1186/s13059-022-02809-5.
9
INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes.噬菌体参考数据库的基础设施:识别当前培养噬菌体基因组集合中的大规模偏差
Phage (New Rochelle). 2021 Dec 1;2(4):214-223. doi: 10.1089/phage.2021.0007. Epub 2021 Dec 16.
10
ANI analysis of poxvirus genomes reveals its potential application to viral species rank demarcation.痘病毒基因组的ANI分析揭示了其在病毒物种等级划分中的潜在应用。
Virus Evol. 2022 May 28;8(1):veac031. doi: 10.1093/ve/veac031. eCollection 2022.