• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SNPPar:从微生物全基因组比对中识别趋同进化和其他同型现象。

SNPPar: identifying convergent evolution and other homoplasies from microbial whole-genome alignments.

机构信息

Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, Victoria, Australia.

Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria, Australia.

出版信息

Microb Genom. 2021 Dec;7(12). doi: 10.1099/mgen.0.000694.

DOI:10.1099/mgen.0.000694
PMID:34874243
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8767352/
Abstract

Homoplasic SNPs are considered important signatures of strong (positive) selective pressure, and hence of adaptive evolution for clinically relevant traits such as antibiotic resistance and virulence. Here we present a new tool, SNPPar, for efficient detection and analysis of homoplasic SNPs from large whole genome sequencing datasets (>1000 isolates and/or >100 000 SNPs). SNPPar takes as input an SNP alignment, tree and annotated reference genome, and uses a combination of simple monophyly tests and ancestral state reconstruction (ASR, via TreeTime) to assign mutation events to branches and identify homoplasies. Mutations are annotated at the level of codon and gene, to facilitate analysis of convergent evolution. Testing on simulated data (120 alignments representing local and global samples) showed SNPPar can detect homoplasic SNPs with very high specificity (zero false-positives in all tests) and high sensitivity (zero false-negatives in 89 % of tests). SNPPar analysis of three empirically sampled datasets (, and ) produced results that were in concordance with previous studies, in terms of both individual homoplasies and evidence of convergence at the codon and gene levels. SNPPar analysis of a simulated alignment of ~64 000 genome-wide SNPs from 2000 genomes took ~23 min and ~2.6 GB of RAM to generate complete annotated results on a laptop. This analysis required ASR be conducted for only 1.25 % of SNPs, and the ASR step took ~23 s and 0.4 GB of RAM. SNPPar automates the detection and annotation of homoplasic SNPs efficiently and accurately from large SNP alignments. As demonstrated by the examples included here, this information can be readily used to explore the role of homoplasy in parallel and/or convergent evolution at the level of nucleotide, codon and/or gene.

摘要

同态 SNP 被认为是强(正)选择压力的重要特征,因此也是与临床相关特征(如抗生素耐药性和毒力)相关的适应性进化的重要特征。在这里,我们提出了一种新的工具 SNPPar,用于从大型全基因组测序数据集(>1000 个分离株和/或>100000 个 SNPs)中高效检测和分析同态 SNP。SNPPar 以 SNP 比对、树和注释的参考基因组作为输入,使用简单的单系测试和祖先状态重建(通过 TreeTime)的组合,将突变事件分配给分支并识别同态性。突变在密码子和基因水平上进行注释,以方便对趋同进化进行分析。在模拟数据(代表局部和全局样本的 120 个比对)上的测试表明,SNPPar 可以以非常高的特异性(所有测试均无假阳性)和高灵敏度(89%的测试均无假阴性)检测同态 SNP。对三个经验采样数据集(1000 个细菌基因组、1000 个结核分枝杆菌基因组和 33 个淋病奈瑟氏球菌基因组)的 SNPPar 分析结果与以前的研究一致,无论是在个体同态性方面,还是在密码子和基因水平上的趋同证据方面。对来自 2000 个基因组的约 64000 个全基因组 SNP 的模拟比对进行的 SNPPar 分析,在笔记本电脑上生成完整注释结果需要约 23 分钟和 2.6GB 的 RAM。此分析仅需要对 1.25%的 SNPs 进行 ASR,ASR 步骤需要约 23 秒和 0.4GB 的 RAM。SNPPar 可以有效地从大型 SNP 比对中自动检测和注释同态 SNP,并且准确无误。正如这里包含的示例所示,这些信息可以很容易地用于探索同态性在核苷酸、密码子和/或基因水平上的平行和/或趋同进化中的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/00edae297c07/mgen-7-0694-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/d1872b1f2c89/mgen-7-0694-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/b8ce5d285588/mgen-7-0694-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/f8071d2f87d4/mgen-7-0694-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/0749f72c40f2/mgen-7-0694-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/dcef7a613262/mgen-7-0694-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/9bfd18338f9f/mgen-7-0694-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/508cedb0d25c/mgen-7-0694-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/3b0af0592035/mgen-7-0694-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/00edae297c07/mgen-7-0694-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/d1872b1f2c89/mgen-7-0694-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/b8ce5d285588/mgen-7-0694-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/f8071d2f87d4/mgen-7-0694-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/0749f72c40f2/mgen-7-0694-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/dcef7a613262/mgen-7-0694-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/9bfd18338f9f/mgen-7-0694-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/508cedb0d25c/mgen-7-0694-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/3b0af0592035/mgen-7-0694-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce2/8767352/00edae297c07/mgen-7-0694-g009.jpg

相似文献

1
SNPPar: identifying convergent evolution and other homoplasies from microbial whole-genome alignments.SNPPar:从微生物全基因组比对中识别趋同进化和其他同型现象。
Microb Genom. 2021 Dec;7(12). doi: 10.1099/mgen.0.000694.
2
QuantTB - a method to classify mixed Mycobacterium tuberculosis infections within whole genome sequencing data.QuantTB - 一种在全基因组测序数据中分类混合结核分枝杆菌感染的方法。
BMC Genomics. 2020 Jan 28;21(1):80. doi: 10.1186/s12864-020-6486-3.
3
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.当全基因组比对无法奏效时:用于数百个微生物基因组的无比对单核苷酸多态性(SNP)发现及系统发育分析的kSNP v2软件
PLoS One. 2013 Dec 9;8(12):e81760. doi: 10.1371/journal.pone.0081760. eCollection 2013.
4
Large-scale genomic analysis shows association between homoplastic genetic variation in Mycobacterium tuberculosis genes and meningeal or pulmonary tuberculosis.大规模基因组分析显示,结核分枝杆菌基因中的同形遗传变异与脑膜或肺结核之间存在关联。
BMC Genomics. 2018 Feb 5;19(1):122. doi: 10.1186/s12864-018-4498-z.
5
Convergent evolutionary analysis identifies significant mutations in drug resistance targets of Mycobacterium tuberculosis.趋同进化分析确定了结核分枝杆菌耐药靶点中的显著突变。
Antimicrob Agents Chemother. 2008 Sep;52(9):3369-76. doi: 10.1128/AAC.00309-08. Epub 2008 Jun 30.
6
Whole-genome sequencing and single nucleotide polymorphisms in multidrug-resistant clinical isolates of Mycobacterium tuberculosis from the Philippines.菲律宾耐多药结核分枝杆菌临床分离株的全基因组测序和单核苷酸多态性。
J Glob Antimicrob Resist. 2018 Dec;15:239-245. doi: 10.1016/j.jgar.2018.08.009. Epub 2018 Aug 18.
7
A Comprehensive Map of Mycobacterium tuberculosis Complex Regions of Difference.结核分枝杆菌复合体区域差异综合图谱。
mSphere. 2021 Aug 25;6(4):e0053521. doi: 10.1128/mSphere.00535-21. Epub 2021 Jul 21.
8
Comparative genomic analysis of Mycobacterium tuberculosis clinical isolates.结核分枝杆菌临床分离株的比较基因组分析。
BMC Genomics. 2014 Jun 13;15(1):469. doi: 10.1186/1471-2164-15-469.
9
Revisiting the taxonomy of the genus Elizabethkingia using whole-genome sequencing, optical mapping, and MALDI-TOF, along with proposal of three novel Elizabethkingia species: Elizabethkingia bruuniana sp. nov., Elizabethkingia ursingii sp. nov., and Elizabethkingia occulta sp. nov.利用全基因组测序、光学图谱和基质辅助激光解吸电离飞行时间质谱重新审视伊丽莎白菌属的分类,并提出三个新的伊丽莎白菌种:布鲁恩伊丽莎白菌(Elizabethkingia bruuniana)新种、乌尔辛伊丽莎白菌(Elizabethkingia ursingii)新种和隐匿伊丽莎白菌(Elizabethkingia occulta)新种。
Antonie Van Leeuwenhoek. 2018 Jan;111(1):55-72. doi: 10.1007/s10482-017-0926-3. Epub 2017 Aug 30.
10
Construction of a virtual Mycobacterium tuberculosis consensus genome and its application to data from a next generation sequencer.结核分枝杆菌虚拟共有基因组的构建及其在新一代测序仪数据中的应用。
BMC Genomics. 2015 Mar 20;16(1):218. doi: 10.1186/s12864-015-1368-9.

引用本文的文献

1
Wave succession in the pandemic clone of Vibrio parahaemolyticus driven by gene loss.基因缺失驱动的副溶血性弧菌大流行克隆中的波状演替
Nat Ecol Evol. 2025 Aug 27. doi: 10.1038/s41559-025-02827-z.
2
Evaluating selection at intermediate scales within genes provides robust identification of genes under positive selection in clinical isolates.评估基因内部中等尺度的选择情况,能够在临床分离株中可靠地鉴定出受到正选择的基因。
bioRxiv. 2025 May 20:2025.05.07.652684. doi: 10.1101/2025.05.07.652684.
3
Evolution of pathogenic Escherichia coli harboring the transmissible locus of stress tolerance: from food sources to clinical environments.

本文引用的文献

1
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.IQ-TREE 2:基因组时代系统发育推断的新模型和有效方法。
Mol Biol Evol. 2020 May 1;37(5):1530-1534. doi: 10.1093/molbev/msaa015.
2
Use of Whole-Genome Sequencing for Food Safety and Public Health in the United States.全基因组测序在美国的食品安全和公共卫生中的应用。
Foodborne Pathog Dis. 2019 Jul;16(7):441-450. doi: 10.1089/fpd.2019.2662. Epub 2019 Jun 13.
3
RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.
携带应激耐受可传递位点的致病性大肠杆菌的进化:从食物来源到临床环境。
Sci Rep. 2025 Feb 11;15(1):5014. doi: 10.1038/s41598-025-89066-1.
4
Genomic perspective on the bacillus causing paratyphoid B fever.引起副伤寒B热的杆菌的基因组视角。
Nat Commun. 2024 Dec 10;15(1):10143. doi: 10.1038/s41467-024-54418-4.
5
Persistent Salmonella enterica serovar Typhi sub-populations within host interrogated by whole genome sequencing and metagenomics.全基因组测序和宏基因组学检测宿主内持续存在的伤寒沙门氏菌血清型 Typhi 亚群。
PLoS One. 2023 Aug 23;18(8):e0289070. doi: 10.1371/journal.pone.0289070. eCollection 2023.
6
Phase variation as a major mechanism of adaptation in complex.相变异作为复杂适应的主要机制。
Proc Natl Acad Sci U S A. 2023 Jul 11;120(28):e2301394120. doi: 10.1073/pnas.2301394120. Epub 2023 Jul 3.
7
Analysis of Genome-Wide Mutational Dependence in Naturally Evolving Mycobacterium tuberculosis Populations.全基因组突变依赖性分析在自然进化的结核分枝杆菌群体中。
Mol Biol Evol. 2023 Jun 1;40(6). doi: 10.1093/molbev/msad131.
8
Mutation rates and adaptive variation among the clinically dominant clusters of .在临床上占主导地位的 群集中的突变率和适应性变异。
Proc Natl Acad Sci U S A. 2023 May 30;120(22):e2302033120. doi: 10.1073/pnas.2302033120. Epub 2023 May 22.
9
The clinical features and genomic epidemiology of carbapenem-resistant Acinetobacter baumannii infections at a tertiary hospital in Vietnam.越南一家三级医院耐碳青霉烯鲍曼不动杆菌感染的临床特征和基因组流行病学。
J Glob Antimicrob Resist. 2023 Jun;33:267-275. doi: 10.1016/j.jgar.2023.04.007. Epub 2023 Apr 27.
10
Integrative Reverse Genetic Analysis Identifies Polymorphisms Contributing to Decreased Antimicrobial Agent Susceptibility in .综合反向遗传学分析鉴定出导致 对抗菌药物敏感性降低的多态性。
mBio. 2022 Feb 22;13(1):e0361821. doi: 10.1128/mbio.03618-21. Epub 2022 Jan 18.
RAxML-NG:用于最大似然系统发育推断的快速、可扩展和用户友好的工具。
Bioinformatics. 2019 Nov 1;35(21):4453-4455. doi: 10.1093/bioinformatics/btz305.
4
Interactive Tree Of Life (iTOL) v4: recent updates and new developments.交互式生命树 (iTOL) v4:最新更新和新发展。
Nucleic Acids Res. 2019 Jul 2;47(W1):W256-W259. doi: 10.1093/nar/gkz239.
5
HomoplasyFinder: a simple tool to identify homoplasies on a phylogeny.同形异源发生识别器:一个用于在系统发育树上识别同形异源发生的简单工具。
Microb Genom. 2019 Jan;5(1). doi: 10.1099/mgen.0.000245. Epub 2019 Jan 21.
6
Recent advances in computational phylodynamics.计算系统发育动力学的最新进展。
Curr Opin Virol. 2018 Aug;31:24-32. doi: 10.1016/j.coviro.2018.08.009. Epub 2018 Sep 22.
7
Frequent transmission of the Mycobacterium tuberculosis Beijing lineage and positive selection for the EsxW Beijing variant in Vietnam.越南频繁传播结核分枝杆菌北京家族和对北京型 EsxW 的正向选择。
Nat Genet. 2018 Jun;50(6):849-856. doi: 10.1038/s41588-018-0117-9. Epub 2018 May 21.
8
Evaluation of phylogenetic reconstruction methods using bacterial whole genomes: a simulation based study.使用细菌全基因组评估系统发育重建方法:一项基于模拟的研究
Wellcome Open Res. 2018 Mar 23;3:33. doi: 10.12688/wellcomeopenres.14265.2. eCollection 2018.
9
Genome-wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis.全基因组分析多药和广泛耐药结核分枝杆菌。
Nat Genet. 2018 Feb;50(2):307-316. doi: 10.1038/s41588-017-0029-0. Epub 2018 Jan 22.
10
TreeTime: Maximum-likelihood phylodynamic analysis.TreeTime:最大似然系统发育动力学分析。
Virus Evol. 2018 Jan 8;4(1):vex042. doi: 10.1093/ve/vex042. eCollection 2018 Jan.