• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用uvaia进行可扩展的邻域搜索和比对。

Scalable neighbour search and alignment with uvaia.

作者信息

de Oliveira Martins Leonardo, Mather Alison E, Page Andrew J

机构信息

Quadram Institute Bioscience, Norwich, United Kingdom.

University of East Anglia, Norwich, United Kingdom.

出版信息

PeerJ. 2024 Mar 6;12:e16890. doi: 10.7717/peerj.16890. eCollection 2024.

DOI:10.7717/peerj.16890
PMID:38464752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10924453/
Abstract

Despite millions of SARS-CoV-2 genomes being sequenced and shared globally, manipulating such data sets is still challenging, especially selecting sequences for focused phylogenetic analysis. We present a novel method, uvaia, which is based on partial and exact sequence similarity for quickly extracting database sequences similar to query sequences of interest. Many SARS-CoV-2 phylogenetic analyses rely on very low numbers of ambiguous sites as a measure of quality since ambiguous sites do not contribute to single nucleotide polymorphism (SNP) differences. Uvaia overcomes this limitation by using measures of sequence similarity which consider partially ambiguous sites, allowing for more ambiguous sequences to be included in the analysis if needed. Such fine-grained definition of similarity allows not only for better phylogenetic analyses, but could also lead to improved classification and biogeographical inferences. Uvaia works natively with compressed files, can use multiple cores and efficiently utilises memory, being able to analyse large data sets on a standard desktop.

摘要

尽管全球已对数百万个严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组进行了测序和共享,但处理此类数据集仍然具有挑战性,尤其是为重点系统发育分析选择序列。我们提出了一种新方法——uvaia,它基于部分和精确序列相似性,用于快速从数据库中提取与感兴趣的查询序列相似的序列。许多SARS-CoV-2系统发育分析依赖于非常少量的模糊位点作为质量衡量标准,因为模糊位点对单核苷酸多态性(SNP)差异没有贡献。uvaia通过使用考虑部分模糊位点的序列相似性度量克服了这一限制,从而在需要时允许将更多模糊序列纳入分析。这种对相似性的细粒度定义不仅有助于进行更好的系统发育分析,还可能改善分类和生物地理学推断。uvaia原生支持处理压缩文件,可以使用多个内核并高效利用内存,能够在标准桌面上分析大型数据集。

相似文献

1
Scalable neighbour search and alignment with uvaia.使用uvaia进行可扩展的邻域搜索和比对。
PeerJ. 2024 Mar 6;12:e16890. doi: 10.7717/peerj.16890. eCollection 2024.
2
Taxonium, a web-based tool for exploring large phylogenetic trees.Taxonium,一个用于探索大型系统发育树的网络工具。
Elife. 2022 Nov 15;11:e82392. doi: 10.7554/eLife.82392.
3
Inferring the genetic variability in Indian SARS-CoV-2 genomes using consensus of multiple sequence alignment techniques.利用多重序列比对技术共识推断印度 SARS-CoV-2 基因组中的遗传变异。
Infect Genet Evol. 2020 Nov;85:104522. doi: 10.1016/j.meegid.2020.104522. Epub 2020 Sep 1.
4
Online Phylogenetics with matOptimize Produces Equivalent Trees and is Dramatically More Efficient for Large SARS-CoV-2 Phylogenies than de novo and Maximum-Likelihood Implementations.在线系统发育学与 matOptimize 产生等效的树,并且比从头开始和最大似然实现对大型 SARS-CoV-2 系统发育更有效率。
Syst Biol. 2023 Nov 1;72(5):1039-1051. doi: 10.1093/sysbio/syad031.
5
Sequence analysis for SNP detection and phylogenetic reconstruction of SARS-cov-2 isolated from Nigerian COVID-19 cases.对从尼日利亚新冠肺炎病例中分离出的新冠病毒进行单核苷酸多态性检测及系统发育重建的序列分析。
New Microbes New Infect. 2022 Jan;45:100955. doi: 10.1016/j.nmni.2022.100955. Epub 2022 Jan 18.
6
Comprehensive comparative genomic and microsatellite analysis of SARS, MERS, BAT-SARS, and COVID-19 coronaviruses.全面比较 SARS、MERS、BAT-SARS 和 COVID-19 冠状病毒的基因组和微卫星分析。
J Med Virol. 2021 Jul;93(7):4382-4391. doi: 10.1002/jmv.26974. Epub 2021 Apr 8.
7
DecentTree: scalable Neighbour-Joining for the genomic era.DecentTree:基因组时代可扩展的近邻连接算法。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad536.
8
A new profiling approach for DNA sequences based on the nucleotides' physicochemical features for accurate analysis of SARS-CoV-2 genomes.一种基于核苷酸理化特征的 DNA 序列新分析方法,可准确分析 SARS-CoV-2 基因组。
BMC Genomics. 2023 May 18;24(1):266. doi: 10.1186/s12864-023-09373-7.
9
Stability of SARS-CoV-2 phylogenies.SARS-CoV-2 系统发育的稳定性。
PLoS Genet. 2020 Nov 18;16(11):e1009175. doi: 10.1371/journal.pgen.1009175. eCollection 2020 Nov.
10
Whole genome analysis of more than 10 000 SARS-CoV-2 virus unveils global genetic diversity and target region of NSP6.对超过 10000 株 SARS-CoV-2 病毒进行全基因组分析揭示了全球遗传多样性和 NSP6 的靶标区域。
Brief Bioinform. 2021 Mar 22;22(2):1106-1121. doi: 10.1093/bib/bbab025.

引用本文的文献

1
Temporal dynamics of SARS-CoV-2 shedding in feces and saliva: a longitudinal study in Norfolk, United Kingdom during the 2021-2022 COVID-19 waves.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)在粪便和唾液中的排出时间动态:2021 - 2022年新冠疫情期间在英国诺福克郡的一项纵向研究。
Microbiol Spectr. 2025 Mar 25;13(5):e0319524. doi: 10.1128/spectrum.03195-24.
2
A regional genomic surveillance program is implemented to monitor the occurrence and emergence of SARS-CoV-2 variants in Yubei District, China.中国渝北区实施了区域基因组监测计划,以监测 SARS-CoV-2 变异株的发生和出现。
Virol J. 2024 Jan 8;21(1):13. doi: 10.1186/s12985-023-02279-6.

本文引用的文献

1
Repurposing an integrated national influenza platform for genomic surveillance of SARS-CoV-2 in Ghana: a molecular epidemiological analysis.重新利用综合国家流感平台对加纳的 SARS-CoV-2 进行基因组监测:分子流行病学分析。
Lancet Glob Health. 2023 Jul;11(7):e1075-e1085. doi: 10.1016/S2214-109X(23)00189-4.
2
Online Phylogenetics with matOptimize Produces Equivalent Trees and is Dramatically More Efficient for Large SARS-CoV-2 Phylogenies than de novo and Maximum-Likelihood Implementations.在线系统发育学与 matOptimize 产生等效的树,并且比从头开始和最大似然实现对大型 SARS-CoV-2 系统发育更有效率。
Syst Biol. 2023 Nov 1;72(5):1039-1051. doi: 10.1093/sysbio/syad031.
3
Genomics-informed outbreak investigations of SARS-CoV-2 using civet.
利用果子狸对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)进行基于基因组学的疫情调查。
PLOS Glob Public Health. 2022 Dec 9;2(12):e0000704. doi: 10.1371/journal.pgph.0000704. eCollection 2022.
4
Dynamics of competing SARS-CoV-2 variants during the Omicron epidemic in England.奥密克戎疫情期间英国竞争 SARS-CoV-2 变体的动态。
Nat Commun. 2022 Jul 28;13(1):4375. doi: 10.1038/s41467-022-32096-4.
5
SARS-CoV-2 lineage dynamics in England from September to November 2021: high diversity of Delta sub-lineages and increased transmissibility of AY.4.2.2021 年 9 月至 11 月期间英国 SARS-CoV-2 谱系动态:Delta 亚谱系多样性高,AY.4.2 传播能力增强。
BMC Infect Dis. 2022 Jul 27;22(1):647. doi: 10.1186/s12879-022-07628-4.
6
Replacement of the Alpha variant of SARS-CoV-2 by the Delta variant in Lebanon between April and June 2021.2021 年 4 月至 6 月期间,黎巴嫩出现的 SARS-CoV-2 的阿尔法变异株被德尔塔变异株取代。
Microb Genom. 2022 Jul;8(7). doi: 10.1099/mgen.0.000838.
7
distAngsd: Fast and Accurate Inference of Genetic Distances for Next-Generation Sequencing Data.distAngsd:用于下一代测序数据的快速准确的遗传距离推断。
Mol Biol Evol. 2022 Jun 2;39(6). doi: 10.1093/molbev/msac119.
8
Variation at Spike position 142 in SARS-CoV-2 Delta genomes is a technical artifact caused by dropout of a sequencing amplicon.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)德尔塔基因组中刺突蛋白第142位的变异是由测序扩增子缺失导致的技术假象。
Wellcome Open Res. 2021 Nov 10;6:305. doi: 10.12688/wellcomeopenres.17295.1. eCollection 2021.
9
Genomic assessment of quarantine measures to prevent SARS-CoV-2 importation and transmission.基因组评估检疫措施以防止 SARS-CoV-2 的输入和传播。
Nat Commun. 2022 Feb 23;13(1):1012. doi: 10.1038/s41467-022-28371-z.
10
Genomic Surveillance for SARS-CoV-2 Variants: Predominance of the Delta (B.1.617.2) and Omicron (B.1.1.529) Variants - United States, June 2021-January 2022.SARS-CoV-2 变体的基因组监测:Delta(B.1.617.2)和奥密克戎(B.1.1.529)变体占主导地位-美国,2021 年 6 月-2022 年 1 月。
MMWR Morb Mortal Wkly Rep. 2022 Feb 11;71(6):206-211. doi: 10.15585/mmwr.mm7106a4.